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IMPROVED PRODUCTION OF 
BACTERIAL STRAINS CROSS REFERENCE 
TO RELATED APPLICATIONS 

S 

This application claims priority to U.S. Provisional Application 60/416,167, filed 
i. October 4, 2002 and to U.S. Provisional Application 60/374,931, filed October 4, 2002 both 

of which are hereby incorporated by reference. 

to 

FIELp OFTHE INVENTION 

The present invention relates to alteration of metabolic pathways in bacteria) strains 
and particularly in B. cofh Mora specifically the invention pertains to manipulating global 
is regulatory genes, such as arcA and rpoB in bacterial host cells to enhance the production 
of desired products. 

BACKGROUND OF THE INVENTION 

20 Production of chemicals through cultivation of microorganisms is generally 

performed under well-controlled conditions wherein a constant environment regarding 
temperature, oxygen concentration and pH are maintained. In addition, culture vessels, 
which include the microorganisms, are fed with different typ^s of nutrients that cells use for 
growth and the production of desired chemical compounds. Under these controlled 

25 conditions, certain metabolic pathways and physiological responses programmed into a 
cell's genome are unnecessary, and in fact, may interfere with the optimized performance 
of a preferred process. In bacterial cells a number of reactions in these metabolic pathways 
and physiological responses can be eliminated or enhanced to improve the production of 
desired products. 

m Figure 1 1llustrates a number of pathways and reactions involved in the assimilation 

of glucose. Enhanced efficient glucose assimilation can result in increased energy 
production, such es ATP, NADH, NADPH and FADH, for use by a bacterial strain. The 
increased energy production can then result In an increase In production and/or growth per ' 
unit weight of generating blomase or per unit weight of provided carbon substrate. 

35 Generally , increased glucose assimtetlcnri arwJ metabolite production in 

microorganisms by genetic engineering has involved inactivation or deregulation of specific 
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enzymes; overexpression of genes that are Involved In central metabolic routes, such as 
glycolysis and the TCA cycle; the introduction of heterologous genes; and various other 
approaches. However, microbial strains regulate their physiology not Just by controlling 
specific steps in a pathway, but also by coordinating a number or blosynthette and catabolic 

s pathways through the use of global regulators. One of these global regulatory systems Is 
the ArcB/ArcA regulatory system. Another global regulatory system is the RpoS system. 

In particular, In facultative anaerobic bacteria, such as E co//, some of the enzymes 
of the tricarboxylic acid (TCA) cycle are repressed under anaerobic conditions and 
particularly when the microorganisms are grown on glucose. The response of the TCA 

10 cycle enzymes in the absence to oxygen is regulated mainly at the transcriptional level by 
the ArcB/ArcA two-component regulatory system {NCBI entries NP 418818 {AE0OO400) 
and NP 417677 (AE000510)). ArcB (histidine kinase) senses the absence of oxygen and 
subsequently phosphorylates the regulator ArcA* The phosphorylated-ArcA then binds to 
regulatory regions, which are operabfy linked to the coding regions of TCA cycle regulated 

i* enzymes and either activates*or*represses transcription. Examples of enzymes regulated in 
this manner Include succinate dehydrogenase, 2-oxoglutarate dehydrogenase, isocrtrate 
dehydrogenase, and citrate synthase, which are all repressed by phosphorated ArcA. 
(Lynch et al, (1996) in regulation of Gene Expression in Escherichia cou. Eds. Lin et 
al., Chapman and Hall, Naw York). In arcA mutants, TCA cycle genes that were repressed 

2D by arcA become derepressed. (luchi et aL. (1988) PNAS USA 85:1888-1892). Prohi et at. 
compared the production of products and specific enzyme activities In an £. co// wild-type 
strain and an £ coJS arcA mutant strain, wherein the a/cA was inactivated when the strains 
were grown on glucose or glycerol in the presence of oxygen or nitrate. (Prohi et aL, (1 998) 
Arch. MforobfoL 1 70:1-7). The authors concluded that the strong repression of the TCA 

as cycle during nitrate respiration (anaerobic conditions) occurs only during growth on glucose 
but not on other substrates, such as glycerol. However, deletion of arcAdid not have any 
affect on E. con strains when grown under aerobic conditions. 

RpoS genes encode DNA-dependent RNA polymerase sigma subunits and when 
bound to the core RNA polymerase, allows the transcription of a subset of genes. For 

90 example in E. col/, rpoS is Induced under various stress conditions and controls the 

expression of a catalase gene (katE) and glycogen biosynthesis, RpoS Is believed to be a 
global regulator of the general stress response In bacteria and can be triggered by many 
different stress signals. Often, activation of rpoS is accompanied by a reduction or 
cessation of bacterial growth. This mechanism provides the host microorganism with the 

35 ability to survive the encountered stress as well as potential other stresses. (Hengge - 
Aronls, R. (2002) MhrobtoL Mol. BloL Rev. 66:373-395). 
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The major function of the general stress response is preventive. There are more 
than 70 rpoS dependent genes known and the majority of these confer resistance against 
oxidative stress, near-UV radiation, potential*/ lethal heat shock, hyperosmolar^, acidic pH 
and organic solvents. Compared with wild-type rpoS strains, rpoS mutants were unable 
s able to induce the general stress response In bacteria and could not survive prolonged 
starvation periods (Hengge-Aronis R. (2002) MfcrobtoL Mol. BhL &ev. 66:373-395). In 
recombinant bacterial strains, the production of protein during culture, and particularly 
during late stage fed-batch fermentation, may result in a general stressful condition, which 
may induce the RpoS-dependent response, tt has been demonstrated that in continuous 
10 culture, at low dilution rates of, tor example 0.13rr\ a /poS" strain produced more 

recombinant protein than the wild-type strain. However, the effect was promoter dependent, 
and at higher dilution rates (0.38h* 1 ) the mutation had no effect on the production of 
recombinant protein. (Chou et ah, (1996) Bfotechnoi. Bioeng. 50:636-642). It is believed the 
effect of a rpoS mutation on the production of metabolites has not been reported. 
a The phosphoenolpyruvate carboxylase (Ppc) errzyme (E.C. 4.1.1.31/32/36/49) 

catalyzes the conversion of phosphoenolpyruvate (PEP) to oxaloaoetate (OAA). Ppc 
replenishes the TCA cycle by directly providing OAA. which may have been removed for 
other biosynthetic reactions, rf OAA is not replenished, the efficiency of the TCA cycle may 
be diminished. £ Coti cells having an inactivated ppc (ppc~) grew slower than wild-type - 
20 ppc cells (ppc*), and the ppc* cells used glucose about three times as efficiently as the 
ppc" cells. However, despite this effect, the ppc" ceils overproduced phenylalanine (Phe). 
The ppc~ cells produced at least 16 times more Phe than the ppc ceils. The production of 
unwanted by-products, such as acetate and pyruvate were also stimulated in the ppc 
ceils, (Miller et at., (1987) MicrobtoL 2:143-149). In another study the overproduction of Ppc 
25 in E. coll was effected by cloning the ppc In a multicopy plasmid under a tac promoter 
resulting in an increase in growth yield on glucose as well a decrease in the secretion of 
organic acids. (Uao et al., (1993) AppL environ, Miorobfoh 59:4261-4265). The above 
results suggest that the wild-type level of Ppc is not optimal for the most efficient glucose 
utilization in batch cultures- Overexpression of Ppc is expected to favor the production of 
3D amino acids of the OAA family, such as Asp. Met, tys. Thr and He. 

By modification of a global regulator such as ercA or rpoS and optionally by 
overexpression of ppc, a more efficient metabolic pathway for glucose assimilation may be 
achieved in a bacterial microorganism. 
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RRlgP SUMMARY OF THE INVENTION 

In one embodiment the invention provides a genetically engineered microorganism 
comprising an Inactivated endogenous global regulator selected from the group of arcA 

a genes or rpoS genes and optionally an exogenous promoter operably linked to a 

polynucleotide encoding a protein having phosphoenolpyruvate (PEP) carboxylase activity, 
wherein the engineered microorganism is capable of enhanced production of biornaterials, 
enhanced cell growth and/or enhanced cell productivity. The genetically modified 
microorganism encompassed by the invention may further include inactivation of i) an 

10 endogenous gene encoding a protein having phosphogluconate dehydratase (Edd) activity, 
II) an endogenous gene encoding a protein having phosphoacetylklnase (Pta) activity. Hi) 
an endogenous gene encoding a protein having acetate kinase (AckA) activity and/or iv) an 
endogenous gene encoding a protein having methygloxyal synthase (MgsA) activity. 

One aspect of the invention pertains to a method of enhancing the production of a 

is desired product In a bacterial host cell which comprises modifying a bacterial host cell by 
inactivating an endogenous aroA gene and cufturing the modified bacterial host cell in 
suitable culture media comprising glucose under aerobic conditions to allow production of a 
desired product. In one embodiment the bacterial host cell is from a strain of the 
Enterobacteriaoeae family. Preferably the host cell is an E. coil or Pentoea cell, and 

20 preferably the host cell Is a PTS~7GkT cell. In a second embodiment of this aspect, the 
desired product is glycerol, PEP, pyruvate, chorismate, ethanol, succinate or 
dihydroxyacetone-P. Preferably, the desired product is chorismate. According to the 
method of this aspect, the chorismate may be further converted to an aromatic amino acid. 
In a third embodiment of this aspect, the method further comprises inactivating the 

2s expression of an endogenous gene encoding a polypeptide having RpoS activity, Edd 
activity, Pta activity, AckA activity or MgsA activity. In a fourth embodiment of this aspect, 
.-the method further comprises transforming the bacterial host cell with e DMA fragment 
comprising an exogenous promoter, wherein the DNA fragment including the exogenous 
promoter is integrated Into the host ceil chromosome and replaces the endogenous 

io promoter which is operably linked to a PEP carboxylase coding sequence wherein PEP 
carboxylase is overexpressed. In a fifth embodiment, the method further comprises 
isolating the desired product from the culture media. 

A second aspect the Invention pertains to a method of enhancing blomass 
production in bacterial host cells comprising modifying a bacterial host cell by inactivating 

as an endogenous arcA gene and culturing the modified bacterial cell under suitable culture 
conditions wherein said culture conditions Include aerobic fermentation and glucose as a 
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carbon source and wherein blomass production is enhanced in the modified bacterial cell 
compared to blomass production in a corresponding non-modified bacterial cell cultured 
under essentially the same conditions. In one embodiment of the method according to this 
aspect further comprises inactivating an endogenous /poS gene. In a second embodiment 
s the endogenous arcA gene is inactivated by a deletion. In a third embodiment, the method 
further comprises inactivating an endogenous gene encoding a polypeptide having 
phosphogluconate dehydratase activity, phosphotransacetylase activity, acetyl kinase 
activity or methylglyoxyal synthase activity. Preferably the endogenous gene encoding a 
polypeptide having phosphogluconate dehydratase activity is an edd gene; the endogenous 
10 gene encoding a polypeptide having phosphotransacetylase activity Is a pta gene; the 
endogenous gene encoding a polypeptide having acetyl kinase activity is an ac/cA gene; 
and the endogenous gene encoding a polypeptide having methylglyoxyal synthase activity 
is a rngsA gene. In a fourth embodiment the method further comprises isolating the 
modified bacterial cell. Preferred host cell according to this aspect are Escherichia cells, 
i* Pantoea cells, Klebsiella cells, Giuconobacter cells and Erwfofa cells end most preferably 
E. coli cells and Pantoea cells. 

A third aspect of the Invention pertains to the modified bacterial ceils obtained 
according to the method of aspect one or two. 

In a fourth aspect the Invention pertains to genetically engineered bacterial strains 
20 of the Enterobacteriacea family comprising an inactivated endogenous arcA gene and an 
overexpressed polypeptide having PEP carboxylase activity. Preferably the bacterial strain 
Is an Escherichia, Pantoea; Klebsiella, Qluconobaoter or Ervwnra strain and particularly a 
strain B, coli. In a further embodiment, the endogenous arcA gene Is deleted. In yet another 
embodiment the engineered bacterial strain will comprise an inactivated endogenous 
2s rngsA; an inactivated endogenous ecW; and/or an Inactivated endogenous rpoS. In one 
preferred embodiment the engineered bacterial strain will include an overexpressed 
polypeptide having PEP carboxylase which Is operably linked to an exogenous promoter. 
In a preferred embodiment the exogenous promoter is a Gl promoter. 

In a fifth aspect the invention pertains to a genetically engineered bacterial strain 
ao comprising an Inactivated endogenous rpoS gene. In one embodiment the endogenous 
rpoS gene is deleted. In another embodiment the engineered strain further comprises an 
overexpressed polypeptide having PEP carboxylase activity- In yet another embodiment, 
the engineered bacterial strain has a PTS~7Glu* phenotype, which was derived from a 
bacterial strain originairy capable of utPizing a PTS for carbohydrate transport Preferably 
38 the bacterial strain is £. coli strain. 
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In a sixth aspect the invention pertains to a method of enhancing the production of 
an aromatic amino acid in an E. coli host cell comprising, modifying an EjsoB host oell by 
Inactivating an endogenous aroA gene and culturing the modified host cell in suitable 
culture media comprising glucose under aerobic conditions to allow production of an. 
5 aromatic amino acid. In a preferred embodiment the £col/ host cell is a PTS /Gtu + cell, in 
another embodiment the endogenous arc* gene is inactivated by a deletion, in a further 
embodiment the invention relates to the modified E cotf host cell obtained according to the 
method of this aspect. 

" BRIEF DESCRIPTION OF THE DRAWINGS 



Figure 1 A Is a schematic representation of interconnected metabolic routes, 
which are involved in glucose assimilation. The numbers represent key reactions in 
is glucose assimilation. Reactions 1 - 3 comprise the most efficient pathway for glucose 
assimilation. Reaction 4 Is the sum of reactions 1-4. Reaction 5 - 8 are alternative 
routes In glucose assimilation and these routes are in general either less efficient in 
the assimilation of glucose as compared to routes 1 -3 or bypass the formation of 
Important metabolic Intermediates. 
29 Figure 1B further illustrates the numbered reactions of figure 1 A and include 

glycolysis (reaction 1) and the Entner-Doudoroff pathway (reaction 6). 

Figure 2 depicts a general_scheme for the Synthesis and detoxification of 
methylglyoxai (encoded by mgsA) In £ cc//. 

Figure 3 sets forth the nucleotide sequences (SEQ ID NOs. 1 - 18) of 
w oligonucleotide primers used to either construct gene disruption or to confirm gene 
disruption as further discussed in the examples, wherein 

SEQ ID NO* 1 is the nucleotide sequence^of ArcAl ; 
SEQ ID NO. 2 )s the nucleotide sequence of ArcA2; 
SEQ ID NO. 3 is the nucleotide sequence of ArcA3; 
jo SEQ ID NO. 4 is the nucleotide sequence of ArcA4; 

SEQ ID NO. 5 is the nucleotide sequence of Edd1; 
SEQ ID NO. 6 is the nucleotide sequence of Edd2; 
SEQ ID NO. 7 is the nucleotide sequence of Edd3; 
SEQ ID NO. 8 rs the nucleotide sequence of Edd4; 
as SEQ ID NO. 9 is the nucleotide sequence of DackA-F; 

SEQ ID NO. 10 is the nucleotide sequence of DptaR; 
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SEQ ID NO. 1 1 Is the nucleotide sequence of AckU; 
SEQ ID NO. 12 Is the nucleotide sequence of AckD; 
SEQ ID NO. 13 Is the nucleotide sequence of MgsA1; 
SEQ ID NO- 14 is the nucleotide sequence of MgsA2; 

9 SEQ ID NO, 15 Is the nucleotide sequence of MQSA3; 
SEQ ID NO- 16 Is the nucleotide sequence of MgsA4; ' 
SEQ ID NO. 17 is the nucleotide sequence of PpcR; 
SEQ ID NO. 18 is the nucleotide sequence of PpcF; 

SEQ ID NO. 19 is the nucleotide sequence of the 1.6 G! promoter; 

10 SEQ ID NO. 20 is the nucleotide sequence of the short 1 ,6Gl promoter, 
SEQ ID NO. 22 Is the nucleotide sequence of the 1 .5 G1 promoter; 
SEQ ID NO. 23 is the nucleotide sequence of the primer, gapArRI ; 
SEQ ID NO. 24 Is the nucleotide sequence of the primer. gapA-^2; 
SEQ ID NO. 25 is the nucleotide sequence of the primer, gapA-R3; and 

a SEQ ID NO. 26 is the nucleotide sequence of the primer f gapA-RS. 

Figure 4A refers to example 1 and illustrates the change in optical density 
(ODboo). glucose concentration (g/L) end acetate concentration (g/L) in a wild-type 
E.coti strain MG1655 grown in TM2 media over time (hns)- 

Figure 4B Illustrates the change In optical density (ODeoo). glucose 
20 concentration (g/L) and acetate concentration (g/L) in a wild-type £co/f strain MG1655 
arcA~ strain (AaroA) grown in TM2 media over time (hrs) as mora fuUy described fn 
example 1. - - - - • • — 

Figure 5 illustrates the activity In an E coli strain comprising a deletion in a 
rpoS gene 385(rpoS-/4S) compared to a wild-type E. coll strain 336(KLP23/48) In 
zs fermentation media over time as more fully described in example 2_ 

-Figure 6 Illustrates the nucleotide sequence of plasmid pSYCO101 (SEQ ID 
- - .w-NO>*24-)^ -.--v- — • v. — ■ ■«>.* * 

Figure 7 illustrates a plasmid map of pSYCO 101, wherein DAR1 (dihydroxyacetone 
phosphate reductase) and GPP2 (glycerol-phosphate phosphatase) are glycerol pathway 
jo genes: STR(R) is a speotinomydn resistance encoding gene; pSClOl ori is an origin of 
replication of the plasmid; AspAterm is an asperate ammonia lyase gene terminator; 
dhaB1-3, dhaX and orf W, K Y are 1,3-propanedloi pathway genes; Thr atten is an E. coli 
threonine operon attenuator; TonB term is an E coli tonB gene terminator;" and trc is the 
trc promoter. 

35 
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DET AILED. DE SCRIPTION OF THE INVENTION 

The practice of the present invention will employ, unless otherwise Indicated, 
conventional techniques of molecular biology (including recombinant techniques), 
* microbiology, cell biology, and biochemistry, which are within the skill of the art. such 
techniques are explained fully in the literature, such as, MOLECULAR CLONING; A 
LABORATORY MANUAL, second edition (Sambnook et al. v 1989) CokJ Spring Harbor 
Laboratory Press; Current Protocols in Molecular Biology (P.M. Ausubel et aL, eds., 
1987 and annuel updates); Oligonucleotide Synthesis (M.J, Gait, ed. r 1984); PCR: The 

10 POLYMERASE CHAIN REACTION, (Mullis et aL. eds M 1 994); MANUAL OF INDUSTRIAL 

MICROBIOLOGY AND BlOTECHNOLOGY l Second Edition (A.L. Demaln, etal., eds. 1999); 
MANUAL OF METHODS FOR GENERAL BACTERIOLOGY (PhFllipp Gerhardt, R. G. E. Murray, 
Ralph N. Costiiow, Eugene W. Nester, Willis A. Wood, Noel R. Krieg and G. Briggs Phillips, 
eds), pp. 210-213 American Society for Microbiology, Washington, DC. and 

is BIOTECHNOLOGY: A TEXTBOOK OP INDUSTRIAL MICROBIOLOGY, (Thomas D, Brock) Second 
Edition (1989) Sinauer Associates, Inc., Sunderland, Mass. 

A, Dgfipjtions r 

Unless defined otherwise herein, all technical terms and scientific terms used herein 
20 have the same meaning as commonly understood by one of ordinary skiO in the art to which 
this invention pertains. Singleton, et aL K Dictionary OF Microbiology and MOLECULAR 
BIOLOGY, 2D ED., John Wiley and Sons, New York (1994) and Hale and Marham, The 
Harder DICTIONARY OF BIOLOGY. Harper Perennial, New York (1991) provide one of skill 
with general dictionaries of many of the terms used 3n this invention. Numeric ranges are 
2s Inclusive of the numbers defining the range. 

Unless otherwise indicated nucleic acids are written left to right in the 5 1 to 3* 
orientation and amino acid sequences are written left to right in amino to carboxy 
orientation, respectively. 

The headings provided herein are not limitations of the various aspects or 
30 embodiments of the invention which can be had by reference to the specification as a 
whole. Accordingly, the terms defined immediately below are more fully defined by 
reference to the specification as a whole. 

The references, issued patents and pending patent applications cited herein are 
incorporated by reference into this disclosure. 
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The term "global regulator* refers to a regulatory protein or RNA sequence that 
controls at least two Operons, wherein each operon contains genes involved in different 
physiological roles or metabolic pathways. 

The term "operon" refers to a group of genes which are transcribed as a single 
5 transcriptional unit and are subject to coordinated regulation by a common regulatory . 
sequence. 

By a "gene" is meant a segment of DNA Involved In the encoding for regulatory 
RNA's, transfer RNA's> ribosomal RNA's, promoter regions operably linked to the 
expression of a peptide, polypeptide or protein, including the coding region, non-coding 

10 regions preceding Header 11 ) and following ("trailer) the coding region, as well as 
intervening non-coding sequences C , int^orrs ,, ) between individual coding segments 
("exons"). Coding refers to the representation of amino acids, start and stop signals in a 
three base "triplet" code. 

As used herein a "regulatory sequence* Is a nucleic add sequence operably linked 

is to a cooing sequence to effect expression of the coding sequence. The regulatory 
sequence can Inhibit, repress or promote the expression of the operably linked coding 
sequence or translation of the rnRNA. Preferably, the regulatory sequence has a positive 
activity- i.e. binding of an endogenous Rgand (e.g. 3 transcription factor) to the regulatory 
sequence increases transcription, thereby resulting in increased expression of the 

20 corresponding target gene. In such a case, interference with transcription by binding a 
polyamide to a regulatory sequence would reduce or abolish expression of a gene. 
Regulatory sequences Include promoters, enhancers, silencers, operators and transcription 
(actors. 

A "promoter" is a regulatory sequence that is involved in binding RNA polymerase to 
m Initiate transcription of a gene. A promoter will comprise two consensus sequences 
corresponding tea -35 homology box and a -10 homology box with a linker sequence 
located between the -36 box and -10 box. In one embodiment the -35 homology box 
corresponds to TTGACA and the -10 homology box corresponds to TATAAT. Promoters 
are often located upstream (6 P ) to the transcription Initiation site of an operabry linked gene, 
ao A promoter may also include or be adjacent to a regulatory sequence known in the art as a 
silencer. A silencer sequence generally has a negative regulatory effect on expression of 
the gene. 

An "exogenous promoter" as used herein refers to non-native promoters, synthetic 
promoters, and modified naturally occurring promoters. Modified naturally occurring 
sis promoters are promoters that have been derived from endogenous promoters of a host cell, 
wherein the endogenous promoter ore derivative thereof Is reintroduced into the host cell. 



PAGE 1 6/72 * RCVD AT 7/8/2004 2:51 :58 PM [Eastern Daylight Time] * SVR:llSPTO-EFXRF-1/5 * DNIS:8729306 * CSID:202 457 0107 * DURATION (mm-ss): 29-32 



JUL-08-2004 15:07 fiJINOMOTO CORP SERU LLC 202 457 0107 P. 17/72 

y -f — f — {_ » i i * \j unm I »t»vJ7T» \ VTw /N^> -. .- - — '-■ 



WO 2004/033421 PCT/US2003/0 J 1445 

-10- 

Enhancers are nucleic acid regulatory sequences that can increase the utilization of 
promoters, and can function in either orientation or 3-5 1 ) and In any location 
(upstream or downstream) relative to the promoter. 

The term "operably linked' refers to a juxtaposition wherein the elements are in an 
a arrangement allowing them to be functionally related- For example, a promoter Is operebly 
linked to a structural sequence tf rt functions as a specific binding sits for RNA polymerase. 
Additionally a promoter is operabry linked to a coding sequence if tt controls the 
transcription of the sequence and a ribosoma binding site is operably linked to a coding 
sequence if it is positioned so as to permit translation of the mRNA, 
10 a "structural sequence" refers to a putative polynucleotide sequence of DMA that 

encodes a protein. Messenger RNA is its primary product 

The terms H arcA gene" and "a/cA ft used Interchangeability herein refer to a 
polynucleotide which encodes the structural gene of ArcA or a homoiogue, fragment 
derivative or complement thereof and having the arcA gene's function . 
« The terms "rpoS gene" and "rpo& m used interchangeability herein refer to a 

polynucleotide which encodes or comprises the rpoS RNA or a homoiogue, fragment, 
derivative or complement thereof and having the rpoS gene's function. 

The terms "edd gene* and "edcT used Interchangeabitrty herein refer to a 
polynucleotide which encodes or comprises the edd RNA or a homoiogue, fragment, 
20 derivative or complement thereof and having the edd gene's function. 

The terms ~pta gene" and "pfa" used Interchangeable herein refer to a 
• polynucleotide which encodes or comprises the pteRNA or a homoiogue, fragment, 
derivative or complement thereof and having the pta gene's function. 

The terms "ackA gene" and *ackA" used interchangeably herein refer to a 
25 polynucleotide which encodes or comprises the ©ckA RNA or a homoiogue. fragment, 
derivative or complement thereof and having the ackA gene's function. 

— - - T rie-terms "mgsA geneS-and "msgA" used interchangeably herein refer to a 

polynucleotide which encodes or comprises the msgA RNA or a homoiogue, fragment, 
derivative or complement thereof and having the msgA gene's function. 
30 The terms "ArcA" and "polypeptide having ArcA activity 11 refer to an aerobic 

respiration control protein which is a global regulatory protein encoded by the E coti arcA 
gene or a polypeptide having substantial homology thereto and having fi/rA regulatory 
activity. 

The terms "RpoS 11 and "polypeptide having Rpos activity" refer to a polypeptide 
35 encoded by the E coli /poS gene or a polypeptide having substantial homology thereto and 
having rpoS regulatory activity. 
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The terms "E-dd" or "polypeptide having phosphogluconate dehydratase activity" 
referto a polypeptide encoded by the E. coll odd gene ore polypeptide having substantial 
homology thereto and having phosphogluconate dehydratase activity. Typical of Edd is EC 
4-Z1.12. 

& A "polypeptide having phosphogluconate dehydratase activity" refers to polypeptide 

which catalyzes the conversion of gluconate-6-phosphate to a) pyruvate and b) 
g|yceratfehyde-3~phosphate in the catabollsm of gluconate. 

The terms "Pta* and "polypeptide having phosphotransaeetylase activity* refers to a 
polypeptide encoded by the E. cotipte gene or a polypeptide having substantial homology 
to thereto and having phosphotransaeetylase activity. Typical of Pta is E.C. 2.3.1 .8. 

A "polypeptide having phosphotransacetylaee activity* refers to a polypeptide which 
Is capable of converting acetyl coenzyme A to acetate-phosphate. 

The terms Wok" and "polypeptide having acetate kinase activity" refer to a 
polypeptide encoded by the E coU ackA gene or a polypeptide having substantial 
is homology thereto and having acetate kinase activity. Typical of Ack is EC. Z7.2.1 . 

A "polypeptide having acetate kinase activity" refers to a polypeptide capable of 
converting acetyl phosphate to acetate or the reverse. 

The terms "MgsA" and "polypeptide having methylgloxyel synthase" refer to a 
polypeptide encoded by the B. call mgsA. gene or a polypeptide having substantial 
20 homology thereto and having methylgloxyal synthase activity. Typical of MgsA is EC 
4.2.3.3. 

♦ • -A "polypeptide having methylgloxyal synthase activity" refers to a polypeptide 
capable of converting dlhydroxy acetone phosphate into methylgfyoxal and inorganic 
phosphate. 

25 * "The terms "phosphoenolpyruvate (PEP) carboxylase" and "Ppc" refer to a protein 

that catalyzes the conversion of PEP to H2O + CO* to phosphate + oxaloacetate acid 
_*(OAA). typical of Ppc is EC 4.1. 1.31. . . ^ . ... 

The term "endogenous" v/hen referring to a gene or regulatory sequence, means 
that the gene or regulatory sequence originates from within the organism. 
30 The term "exogenous" when referring to a gene or regulatory sequence, means 

that the gene or regulatory sequence originates from outside the organism, e.g.. from 
another strain, species, and or genera. 

The term "homologous" when referring to a gene or polynucleotide sequence, 
means a sequence having functional equivalence to the sequence being referred to. The 
35 homologous sequence regulates the same encoded sequence or regulates a sequence 
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that encodes a polypeptide having the same enzymatic activity as the reference sequence 
even though It may not have 100% amino acid identity. 

The terms "Viorvnjnctlonar, inactivated" and "inactivation* when referring to an 
operon, gene, a global regulatory sequence or a protein means that the known norma! 

6 function or activity of the operon, gene, global regulatory sequence or protein has been 
eliminated or highly diminished* Inactivation which renders the operon, gene, global 
regulatory sequence or protein non-functional Includes any means such as but not limited 
to deletions, mutations, substitutions, Interruptions or insertions. 

Deletions of an operon, gene or global regulator sequence may include deletion of 

id the entire sequence of one or more genes, deletion of part of the sequence of one or more 
genes, deletion of the regulatory region, deletion of the translations! signals and deletion of 
the coding sequence Including flanking regions of one or more genes. 

The terms "polynucleotide" and "nucleic add", used interchangeably herein, refer to 
a polymeric form of nucleotides of any length, either ribonucleotides or 

is deoxyribonudeotides. These terms include a single-, double- or triple-stranded DNA, 
genomic DNA, cDNA, RNA, DNA-RNA hybrid, or a polymer comprising purine and 
pyrimidine bases, or other natural, chemically, biochemically modified, non-natural or 
derivatized nucleotide bases. The following are non-limiting examples of polynucleotides: 
a gene or gene fragment exons, Introns, mRNA, tRNA, rRNA, ribozymes, cDNA, 

20 recombinant polynucleotides, branched polynucleotides, plasm Ids, vectors, isolated DNA of 
any sequence, Isolated RNA of any sequence, nucleic acid probes, and primers. A 
polynucleotide may comprise modified nucleotidesrsuch as methylated nucleotides and 
nucleotide analogs, uracyl, other sugars and linking groups such as fluororibose and 
thioate, and nucleotide branches. The sequence of nucleotides may be interrupted by non- 
29 nucleotide components. 

A nucleic acid is said to "encode" an RNA or a polypeptide if, in its native state or 
when manipulated- by-methods known to those*of skillJn the art, it can be transcribed and/or 
translated to produce the RNA, the polypeptide or a fragment thereof. The anti-sense 
strand of such a nucleic acid Is also said to encode the sequences. 

*> As is known in the art, a DNA can be transcribed by an RNA polymerase to produce 

RNA, but an RNA can be reverse transcribed by reverse transcriptase to produce a DNA. 
Thus a DNA can encode a RNA and vice versa. 

As used herein, the term "vector 41 refers to a polynucleotide construct designed to 
Introduce nucleic acids into one or more cell types. Vectors include cloning vectors, 

35 expression vectors, shuttle vectors, plasmids, cassettes and the like. 
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DNA construct refers to a nucleic acid sequence or polynucleotide generated 
recomblnantly or synthetically for example by In vitro PCR or other suitable techniques, the 
DNA construct is used to introduce nucleic acid Into a host cell. The DMA construct can be 
incorporated into a plasmld, chromosome or nucleic acid fragment The term DNA construct 
a may be used interchangeably with DNA cassette, expression cassette and other 
grammatical equivalents. 

An enzyme is *overexpressed w In a host cell if the enzyme is expressed in the celt at 
a higher levei that the level at which it is expressed In a corresponding wild-type cell. 

As used herein the term "gene" means a DNA segment that is involved in producing 
10 a polypeptide and includes regions preceding and following the coding regions as well as 
intervening sequences (Intro ns) between individual coding segments (exons). 

"Under transcriptional control" or "transcriptionaliy controlled" are terms well 
understood in the art that indicate that transcription of a polynucleotide sequence, usually a 
DNA sequence, depends on its being operebly linked to an element which contributes to 
« the initiation of. or promotes, transcription. 

"Under transnational control* Is a term well understood in the art that indicates a 
regulatory process that occurs after the messenger RNA has been formed. 

As used herein when describing proteins, and genes that encode them, the term for 
the gene is not capitalized and is Italics, i.e. ppa The term for the protein is generally not 
20 italicized and the first letter Is capitalized, i.e. Ppc. 

The terms "protein* and "polypeptide* are used interchangeability herein. The 3- 
letter code foramino acids as defined in conformity with Ihe IUPAC-IUB Joint Commission 
on Biochemical Nomenclature (JCBN) is used through out this disclosure. It is also 
understood that a polypeptide may be coded for by more than one nucleotide sequence 
2$ due to the degeneracy of the genetic code. 

.Encompassed-by the respective peptides are variants thereof in which there have 
been trMal^substttirtlons, deletions, insertions or other modifications of the native peptide 
polypeptide which substantially retain the respective peptide activity characteristics, 
. particularly silent or conservative substitutions. Silent nucleotide substitutions are changes 
so of one or more nucleotides which do not change any amino acid of the respective peptide. 
Conservative substitutions include substitutions within the following groups: glycine, 
alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine t glutamine; 
serine, threonine; lysine, arginine; and phenylalanine, tyrosine. Such conservative 
substitutions are not expected to interfere with the respective biochemical activity, 
w particularly when they occur In structural regions (e.g., alpha helices or beta pleated 
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sheets) of the polypeptide, which can be predicted by standard computer analysis of the 
amino acid sequence of E. coll peptide. 

The terms "PTS', "PTS system" and "Phosphoenolpyruvate (PEP)-dependent 
phosphotransferase transport system" (E.C. 2.7.1.69) refer to the PEP dependent sugar 

s uptake system which transports and phosphorylates glucose Into a ceil. 

The terms "phenotyplcally PTS~7Glu~" f "PTS~7Glu~ phenotype" and "PTS*7Glu~ 
refer to a cell which has a significantly reduced ability to utilize glucose as a carbon source 
because the PTS, normally utilized by such strains to transport and phosphorylate glucose 
is inactivated. - . 

10 The terms "phenotypically PTS~7Glu + \ "PTS"7Glu* phenotype" and PTS"7Glu + refer 

to a cell that is capable of utilizing glucose as a carbon source despite the inactivatton of 
PTS. A PTS~/G1u* strain will have a specific growth rate of at least 0.4 per hour (0.4hr 1 ). 

The term "specific growth rate (p)" refers to the increase of mass or cell number per 
time. In one embodiment the specific growth rate will be at least 0.4hf \ at least O.Shf" 1 , at 

15 least 0.6hr"\ at least O-Zhr" 1 and at least 0.8hr 1 or greater. 

The phrase "enhanced production** refers to an Increase In the amount of desired 
end-product produced by a genetically engineered (altered) microorganism according to the 
invention as compared to the production of the same end-product in a corresponding wild- 
type (unaltered) microorganism- Enhanced production may be determined as an Increased 

20 amount of end product produced per unit of carbon substrate or per amount of 
microorganism. 

<■ - -By "enhanced growth rate"-is meant an increased growth rate of the altered 
bacterial host according to the invention as compared to the growth rate of the \ 
corresponding unaltered bacterial host 

zs By "aerobic conditions- is meant that there Is sufficient O2 available In the 

fermentation medium that the repressive regulation by arcA in an unaltered bacterial host 
cell is-initfated. In one embodiment Oa flow levels of greater than about 0.5 Mers per 
minute, greater than about 1.0 liters per minute, greater than about 2.0 liters per minute, 
greater than about 3.0 liters per minute, greater than about 4.0 liters per minute, and 

30 greater than about 5.0 liters per minute per 14 liter tank are contemplated. In another x 
embodiment not more than 15 Jrtars/min per 14 titer tank Is contemplated. Also between 
about 0.5 to 1 5 liters per minute and between about 0.5 to 12 liters per minute of oxygen 
flow per 14 liter tank are contemplated. 

By "blomass" is meant , when referring to a host ceil, the weight of the host cells in 

u the fermentation media. 
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The term "GUltaring* 1 means the fermentative biocortversion of a carbon substrate to 
a desired end product within a reaction vessel. Typical reaction vessels include but ere not 
limited to vats, bottles, flasks, bags, bioreactors and any other suitable receptacle. 

Bioconversion refers to the contacting a microorganism with a carbon substrate to 
$ convert the carbon substrate to a desired end product 

As used herein, the term "carbon source 11 encompasses suitable carbon substrates 
ordinarily used by microorganisms, such as 6 carbon sugars, Including but not limited to 
glucose (G), gulose, lactose, sorbose, fructose. idose, galactose and mannose all "m either 
D or L form, or a combination of 6 carbon sugars, such as glucose and fructose, and/or 6 
io carbon sugar acids. 

The terms "Isolating* and "isolation" mean to separate a compound from other 
materials. Isolation is meant to achieve a purity of at least 70%, 75%, 06%, 85%, 90%, 
95%. 97% end 99%. . 

The term 'Isolate'* with reference to a nucleic acid or polypeptide means the nucleic 
is add or polypeptide is removed from at least one component with which It Is naturally 
associated. , 

The term -introduced** In the context of inserting a nucleic acid sequence into a cell, 
means protoplast fusion, transfectlon, transformation, transduction, conjugation and the 
like. 

20 The term "transformed" means the cell has a non-native (heterologous) 

polynucleotide sequence integrated into is genome. 

• -The term "chromosomal integration" refers to a process where a polynucleotide is 
Introduced Into the chromosome of a cell 

As used herein, the term "bacteria" refers to any group of microscopic organisms 
as that are prokaryotic, i.e., that lack a membrane-bound nucleus and organelles. All bacteria 
are surrounded by a lipid-membrane that regulates the flow of materials in and out of the 
^eeni~Ar*igid*cell wail completely surrounds the bacterium and lies outside the membrane. 

A "genetically engineered microorganism'' refers to a microorganism having a 
modified genetic sequence as compared to the originalfinitial derivative and/or wild type. 
no An "altered bacterial host" according to the invention is a genetically engineered 

bacterial host cell having an Inactivated endogenous arcA orrpoS and an overexpressed 
ppc. In one embodiment, an altered bacterial host will have an enhanced level of 
production of a desired compound compared to the level of production of the same desired 
compound in a corresponding unaltered bacterial host grown under essentially the same 
3s growth conditions. 
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An "unaltered bacterial host ceir according to the invention Is a bacterial host cell 
wherein the endogenous arcA or endogenous rpoS gene Is not inactivated and remains 
functional. 

As used herein "chromosomal integration" te a process whereby an introduced 

* polynucleotide Is Incorporated Into a host celt chromosome. The process preferably takes 
place by homologous recombination. 

As used herein, "modifying* the level of protein or enzyme activity produced by a 
host cell refers to controlling the levels of protein or enzymatic activity produced during 
culturlng, such that the levels are increased or decreased as desired. 

10 As used herein, the term "modified" when referring to nucleic acid or a 

polynucleotide means that the nucleic acid has been altered in some way as compared to a 
wild type nucleic acid, such as by mutation in; substitution. Insertion, deletion of part or all 
of the nucleic acid; or by being operably linked to a transcriptional control region. 
Examples of mutations include but are not limited to point mutations, frame shift mutations 

is and deletions of part or all of a en?A gene or rpoS gene. 

Desired product" as used herein refers to the desired compound to which a carbon 
substrate is Woconverted into. Exemplary desired products are pyruvate, chorismate, PEP, 
glycerol, OAA, and succinate. * 

By "metabolic product* is meant any sugar, amino acid, protein, alcohol, add, 

ao organic compound, non-organic compound, or derivative or chemically modified version of - 
any of the above which Is produced as a result of a biosynthetic pathway. 

As used herein, trie-term "recombinant* refers to a host cell that has a modification 
of its genome, e.g., as by the addition of nucleic add not naturally occurring in the organism 
or by a modification of nucleic acid naturally occurring in the host cell. 

2s A polynucleotide or polypeptide having substantial homology to another 

polynucleotide or polypeptide wfll have a certain percentage (for example, 80%, 85%, 90%, 
95%, 96%-, 97% or 99%) ^"sequence identity^to another sequence, and means that, 
when aligned, that percentage of bases or amino acids are the same in comparing the two 
sequences. This alignment and the percent homology or sequence identity can be 

30 determined using software programs Known in the art, for example those described In 
Current protocols in molecular biology (F.m. Ausubei et ai.. eds., 1 987) 
Supplement 30, section 7.7.18. A preferred alignment program is ALIGN Plus (Scientific 
' .and Educational Software. Pennsylvania), preferably using default parameters, which are 
as.follows: mismatch = 2; open gap = 0; extend gap = 2. Another sequence software 

as program that could be used is the TFastA Data Searching Program available In the 

Sequence Analysis Software Package Version 6.0 (Genetic Computer Group, University of 
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Wlsconsin, Madison, Wl). One skilled In the art will recognize that sequences encompasses 
by the invention are also defined by the ability to hybridize under stringent or moderate 
conditions wrth the described sequences, 

A nucleic arid is "hybridteable" to another nucleic acid when a single stranded form 
6 of the nucleic ackJ can anneal to the other nucleic acid under appropriate conditions of 
temperature and solution Ionic strength. Hybridization and washing conditions are well 
known and exemplified in Sambrock (1989) supra, (see In particular chapters 9 and 11). 
Low stringency hybridization conditions correspond to a Tm of 55°C (for example 5 X SSC, 
0.1% SDS. 0.25 milk and no formamkie or 5 X SSC. 0.5% SDS and 30% forrnamtde). 
io Moderate stringency hybridization conditions correspond for example, to 6 X SSC, 0.1% 
SDS, 0.05% milk with or without formamlde, and stringent hybridization conditions 
correspond for example, to a Tm of 65°C and 0.1X SSC and 0.1% SDS. 

As used herein, the term -comprising - and its cognates are used in their inclusive 
sense; that is, equivalent to the term including" and Its corresponding cognates, 
is *A, W "an" and 1he n include plural references unless the context clearly dictates 

otherwise. 

*NCBI" refers to the National Center for Biotechnology Information. Natl Library 
Med. (www.ncbf.nlm.nih.govy). 

"ATCC" refers to the American Type Culture Collection international depository 
zq located at 1 0801 University Blvd. Manassas. VA 201 110. 

Preferred embodiments. ■ 

It would be highly advantageous to metabolicaJly engineer bacteria to regulate the 
flow of compounds through various biochemical pathways resulting in an increase in the 
as production of desired compounds, an increase in growth rate of engineered cells or an 
Increase in biomass production of the engineered cells. 

- The present invention encompasses the-lnectivation of an arcA gene and the RNA 
encoded thereby and/or the inactivation of a rpoS gene, said inactivation inhibiting the 
ability of the gene to down regulate expression of metabolic products. 

30 

L H9?t ceils- 

Preferred host cell* that may be genetically engineered according to the invention 
are bacterial ceJ]s, In particular bacterial cells of Salmonella, Citrobactert Pseudomopas, 
Ralston}a y strains within the family of Errtorobacteriaceae and eubacterla. 
« The family "Enterabacteriaceae n refers to bacterial strains having the general 

characteristics of being gram negative and being facultatively anaerobic. Included in the 
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family of Enterobnctenac&ao are ErwinJa, Enterobacter, Gtuconobacter, Serratia, Klebsiella, 
Escherichia and Pantoea. In the present Invention, particularly preferred 
EntervbactQriaceae are Erwinia, Klebsiella, Escherichia and Pantoea, and most preferred 
are fermentation strains of Escherichia and Pantoea. Reference Is also made to Kageyarna 
s et al., (1992) international J. Sys. Bacterid. 42:203. In one embodiment, the preferred host 
cell Is a Pantcea celL Pantoea includes R agglomarans, P. dispersa, P. punctata, P. citrea, 
P. terrea, P. ananas and P, sfewarWand In particular, Pantoea citrea. Pantoea cftrea Is a 
preferred Pantoea strain, for example ATCC Mo. 39140. Pantoea cftrea has sometimes 
been referred to as Erwmia cfavvs or Acetobacter cerinus. Thus, It Is intended mat the 
10 genus Pantoea include species that have been reclassified, including but not limited to 
Erwinia cfoaus or /Icefo&acter cerinus. In another preferred embodiment, the host strain is 
an E. coll cell, for example KLP23 as disclosed in WO 01/O12833A2; RJSN having ATCC 
- No. PTA 4216 and MG1655 having ATCC No. 700926. 

In another embodiment the host strain is one that Is capable of using a PTS for ■ 
ift carbohydrate transport, A general review of the PTS can be found in (Postma et al., 1 993, 
MtcrohfoL Rev- 57:643-694; Romano et al., 1979, J. Bacterid. 13*93^97 and Saler et al. 
1990, In; Bacterial ENERGETICS pp. 273-299, T.A. Krulwlcru Ed. Academic Press, NY)* 
Also reference Is made to Meadow et al. (1990) Anna Rev. Biochem. 59:497-542 

In one preferred embodiment the host strain encompassed by the Invention is a 
29 strain that was originally capable of utilizing a PTS for carbohydrate transport but has been 
©Herod to have an inactive PTS {prvenotypically PTS"7G!u~ ). 

Selection of PTS^-cells can be achieved using techniques available to those skilled 
In the art fnactivation wifl effectively reduce PEP phosphorylation to undetectable levels as 
measured by PEP-dependent phosphorylation of 2-deoxy-D-glucose using the protocols 
25 described by Gachelin, G. (1969). Biochem. Bhphys. Acta. 34:382-387: Romero, et al. t 
. .(1979) J. Bact 139:93-97; or-Bouvet and Grimont. (1987) Ann.-lnst. Pasteur/Miccobiol. 
-438:3-13. Also PEP phosphorylation assays are useful in determining the level of PTS~ 
expression. 

PST7Glu~ host cells may be selected from PTS wild-type host cells by Inactivation 
so of at least one gene encoding part or all of tha enzymes comprising the PTS. By way of 
example, in one embodiment, the PTS Is Inactivated by the deletion of at least one gene 
selected from the group consisting of pt$\, pteH and c/rencoding the El, HPr and 1IA G1C 
proteins of the PTS respectively (Postma, et al (1 993) MiorobioL Rev. 57:543-594). in other 
embodiments, at least two of the genes are inactivated. The nucleotide sequences of pta\ y 
35 ptsH and err have been determined (Saffen et al., (1987) J. BioL Chem. 262:16241-16253; 
Fox et al., (1984) Biochem. Soc. Trans. 12:155-167; Weigel et al.. (1982) J. BioL Chem. 
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257:14461-14469 and DeReuse et al.. (1988) J. Bacterial. 176:3827-3837). in other 
embodiments, the inactivation of all three genes ptei. pteH and err by deletion will 
effectively reduced PEP phosphorylation to undetectable levels. 

A nonllmWng example of a method employed to Inactivate the PTS is as follows. It Is 

a known that in E. caff the ptel, ptsH and err are linked together in an operon- The ptsHlcrr 
operon in E. co// strains such as JM101 (Yanisch-Perron et al. (1985) Gene 33:103 - 119) 
and strain PB1 03 (Mascarenhas (1987) PCT WO/87/01 1 30) can be inactivated by deletion 
using a generalized transduction method as described by Silhavy, et al, (1984) lm 
Experiments WITH Gene Fusions, pp 1 10-1 12, Cold Spring Harbor Laboratory Press, NY. 

i« P1vir phage is used to perform the transduction and strain TP281 1 (Levy et al., (1 990) 
Gene 86:27 - 33) can be used es the donor of the ptsHlcrr deletion. The process can be 
carried out in two stages. First, e celWree suspension of phage is prepared by growing 
bacteriophage P1vlr on strain TP281 1. In the TP281 1 strain most of the ptsHlcrr operon 
has been deleted and a kanamychn -resistant marker may be Inserted in the same DMA 

« region (Levy et al, (1 990) Gene 86:27-33). TtiB obtained P1vlr lysate was able to 
transduce the ptsHlcrr deletion and kanamydn resistance marker simultaneously. 
Secondly, these phage can be used to infect the recipient strains, JM101 or PB103 and 
transductants may be selected by plating the infected cells on MacCon key-glucose plates 
containing kanamydn. The recipient strains (JM101 and PB103) are kanamydn sensitive 

h and form red colonies on MacConkey-giucose plates. The MacConkey-glucose plates 

contain an indicator dye that, depending on the pH, can vary from white to deep red- rf the 
-cells can transport glucose at a fast rate, normally they will secrete organic acids and 
produce red -colonies. If glucose transport is diminished or absent the cells will not produce 
organic acids end the colonies will be white. This enables one to ascertain whether the 

25 * host cell exhibits a glucose* (PTS"7Glu+) or glucose"" (PTS~/Glu~^) phenotype. 

To corroborate If transductants have a PTS"7Glu~ phenotyp© they can be selected 
and inoculated in minimal medium .containing glucose as the only carbon source. One 
would expect after incubatfon (for 12 hours at 37*C) the transductants would have no 
detectable cell growth and the PTS parent strains would grow very well (WO 96/34961). 

» Based on the above results of WO 96/34961 the PTS" derivative of JM101 was designated 
PB1 1 and the PTS" derivative of PB103 was designated NF6. 

Another test for the absence of the PTS system is based on the feet that PTS" 
strains become resistant to the antibiotic fosfomycin Cordaro et aL. (1976) J. Bacteriot 
128:785-793. 

*s One further nonllmitlng method which may be used to inactivate PTS in a bacterial 

host includes inserting or modifying a repressor binding region operably finked with a gene 
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encoding an expressed protein such that the expression of the gene occurs in the presence 
of a specific molecule. For example, the tec operator is a DNA sequence recognized by the 
Lac repressor. If this operator Is located in the proper position, relative to the promoter, 
then if the repressor is bound to the operator, h will Interfere with the binding of the RNA 

5 polymerase and block gene transcription. This blockage can be removed by the addition of 
the inducer IPTG (isopropyl-p-D-thiogaJactoside). The level of repression and/or induction 
will depend on the strength of the promoter* the location and sequence of the operator, as 
well as the amount of the repressor and the inducer (Muller J. et a!., (1996) J. Mo/. Broi. 
257:21-29). The tec operator Is used to regulate a number of promoters, among them 

10 several variants of the lac promoter and the hybrid tru promoter. 

Another nonlimibng method to affect a FTS~7Glu~ phenotype includes the 
incorporation of a promoter which affects the expression of the structural gene sequence 
when certain conditions are present For example, the Pm promoter from the TOL plasmid 
can be used to control gene expression. In this system, gene expression is achieved when 

is benzoate or toluate Is added to the media. (Mermod et at., (1986) J. Bad. 167:447*464). 
Still a further nonllmWng method to affect a PTS - phenotype is to alter the mRNA stability 
as described by Carrier and Keasllng (1 997) BiotechnoL Prog. 1 3:699-708. 

To Increase or redirect carbon flow to desired metabolic pathways In inactivated 
PTS host cells, glucose transport and phosphorylation must be deregulated or amplified, 

20 Therefore in one preferred embodiment a preferred host cell wiH have a restored glucose* 
(PTS~7Glu*) phenotype; that is a PTS~/Glu~ host cell is modified to restore the glucose* 
phenotype, thereby obtaining a PTS"7Glu- phenotype. WO 96/34961 and Hernandez- 
Montalvo et aL (2001) Appi Microbiol Btotechno! 57:1 86-191 described E. coif strains 
having a PTS"7Glu+ phenotype. 

25 

2. inactivatlon of global regulatory genes. 

a. Genes to be Inactivated . - - -- - ■ 

To achieve beneficial increased production levels of desired products, an increased 

growth rate or an increased bioma&s of a host ceil, the inventors developed a method for 
30 Identifying beneficial modifications comprising the step of selecting global regulators that 

have a regulatory effect on a multitude of genes that express polypeptides having beneficial 

enzymatic activities and modifying the global regulatory genes. Exemplary global regulators 

include, but are not limited to arcA, rpoS, Fnr <NP 415850), OxyR (NP 418396), CRP (IMP 

417816), Fur (NP 414622) and H-NS (PQ8936). 
ss In a preferred embodiment the modified global regulatory gene is an a/cA gene and 

homologous genes coding for RNAs or proteins having essentially the same function and 
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having at least 70%. 75%, 80%, 85%. 90%, 93%, 96%, 97% and 99% sequence identity 
thereto. The arcA gene has been characterized (NCBl AE000510). It has also been 
identified and characterized in Samoneila iyphimurium LT2 (NC 003197), Samoneila 
enterica (NC 003098) and Stretomyces cool/color (NC 003888), The ArcA sequence from 
s E. coll has been deposited in the NCBl database with the number Np 41 8818. (Also 

reference Is made to GenBank Accession No. U00096). The ArcB sequence from £ cofi 
has been deposited in the NCBl database with the number NP 417677. (lucht et a!.. (1990) 
Mol. Microbiol 4:715-727). Therefore in one embodiment a polypeptide having ArcA activity 
will have at least 70%, et least 80%, at least 85%, at least 90%, at least 96% and at least 
to 98% sequence homology to the E coii ArcA having the sequence of NP 418818. The 
arcA/arcB regulatory system regulates more than one operon which is involved In 
respiratory and fermentative metabolism in response to oxygen deficiency. The reference 
Lynch et a(., {(1996) Regulation OF Gene Expresshdn in ESCHERICHIA COU Eds. Lin et al.. 
Chapman and Hall NY) list numerous genes that may be regulated by the ArcA/ArcB 
is system. Some of these genes include aceB, acn. QtcA, cob, oyd (A and B), cyo (A, B, C, D 
and E), fedB, Wn(G, H and I), /ocA, pfl, fuwK glpO, gttA, hemA, /iya(A-F). icd, id (P, R and 
D), mcfh, m/o(A-N), pdh, aee(E and F), /pd, pdu, pocR, sdh (A -D). soda, suc{A-V) and <raY; 
and particularly. aceB, arcA, cyo* (A and B), cyo (A B. C. D and E), oflA, mdh, flUOfA-N), 
sdh (A -D) and si/c(A-D). 
20 In another preferred embodiment the modified global regulatory gene is rpoS and 

homologous genes coding for RNAs or protein having. essentially the same function and 
having at least 70%. 75%, 80%. 86%. 90%, 93%, 96%, 97% and 99% sequence identity 
thereto. The rpoS gene has been characterized and the /poS sequence from E.coti has 
been deposited in the NCBl database with the number NP 417221 . Therefore in one 
as embodiment a polypeptide having RpoS activity will have at least 70%, at least 80%. at 
least 85%. at least 90%, at least 95% and at least 98% sequence homology to the E. coti 
-RpoS having the sequence of NP 4*7221-^e4SNA-p0lymerase sigma factor which is the 
product of the rpoS gene binds to the core RNA polymerase under a variety of stress 
factors. While the number and types of genes controlled regulated by rpoS will vary, the 
3, reference of Hengge-Aronis ((1996) In ESCHERICHIA con and Salmonella: Cellular and 
Molecular BIOLOGY, Eds. Neidhardt et aL, ASM Press, Washington DC pages 1492-1512) 
list numerous genes that may be regulated by rpoS. Some of these genes Include a/dB, 
a/dB, app(A r B.C. y), oo/A, dbdAB, cbpK cfa t csg(A, B, D, E, F and G). cs/E. dacC, dps, 
6fnr{A and B), fic r fts{A t Q and 2), geJ(E, K and T) t glg(A and S), gipD. gpr, hde(A and B), 
3s hfm{A and D), MrE, hya{K B. C. D. E and F), kat(B and G). /ecZ. WcC, mcc, o$m(B andY). 
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ofs(A ami B), poxB, p/o(P, V, W, and X), fopA, fnsA, wrbK and xtf>A; and particularly, a/cB. 
dps and poxB. 

The genes of homoiogs of E. co// a/tA and rpoS In other species or strains can be 
obtained by generating cDNA from RNAfram such species using any technique known In 

$ the art, such as using Rlboclone cDNA Synthesis Systems AMV RT (Promega, Madison, 
Wis.)> then probing such cDNA with radiolabeled primers containing various portions (e.g. 
30 or 40 bases long) of the sequences of the E.coli arcA or rpoS disclosed herein and in 
references cited herein. To obtain homoiogs of arcA and rpoS degenerate primers can 
encode the amino add sequence of the disclosed E coll polypeptides but differ in cod on 

to usage from the sequences disclosed. Additional known hybridization techniques and 
computer-automated sequence comparisons and identification using algorithms such as 
BLAST may be used to identify homologous sequences. 

At least one host cell chromosomal region can be modified by mactivation of the 
region coding for the expression of a global regulator a/cA or the global regulator rpoS and 

is those homologous thereto. In activation may occur through the deletion, insertion or 

substitution of at least one nucleotide in the genomic coding region of the wUd type host 
cell. In a preferred embodiment inactivatlon of one or more genes will preferably be a 
stable and non-reverting inactivafion. In a further embodiment, inactjvation results from 
deletion of part or all of the ancA or rpoS gene and those homologous thereto. The deletion 

20 may be partial as long as the sequences left in the chromosome are too short for biological 
activity of the gene. In some embodiments an altered bacterial cell wM Include more than 
one inactivated gene, for examples at least two inactivated genes, at least three Inactivated 
genes, at least four inactivated genes, at least five inactivated genes, at least six 
inactivated genes or more. The inactivated genes may be contiguous to one another or 

25 may be located In separate regions of the host cell chromosome. The inactivated gene may 
have a necessary function under certain conditions but the gene is not necessary for host 
. .celUstreln viability under laboratory conditions such as growth in a fermentation. In a shake 
flask, in plate media and the like. 

30 b. Construction of DNA integration ca^ettes for gene Inactivatlon 

Typically DNA Integration cassettes (also referred to as DNA mutagenic cassettes 
or constructs) which are useful for modifying endogenous chromosomal arcA, rpoS and 
homologous regulatory regions thereto include homologous nucleic acid sequences of the 
regulatory region encoding the expression of the global regulators to be inactivated, a 

35 selectable marker, and sequences for allowing autonomous replication or chromosomal 
integration, such as recomblnase recognition sites. 
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The nucleic acid sequences homologous to upstream (5*) regions of a gene 
encoding a global regulatory protein. These homologous sequences will preferably flank a 
first recomblnase recognition site (5' thereto) and a second recomblnase site (3' thereto). 
Nucleic acid sequences homologous to upstream (5') regions of a gene encoding a global 

5 regulatory protein include sequences derived from a) a sequence 5" to the endogenous 
regulatory region that is targeted for modification, and b) a sequence 3 T of the endogenous 
regulatory region that Is targeted for modification. The 3* sequence may Include parts of a 
global regulatory protein coding sequence. A homologous flanking sequence may include 
from about 2 to 500 bp, about 5 to 250 bp f about 5 to 200 bp, about 5 to i 00 bp and about 

io 5 to 50 bp. 

A mutagenic DNA cassette encompassed by the invention wiH include a selectable 
marker and a number of genes can be used to detect Insertion of the gene In E. co/A Some 
of these genes confer selectable phenotypes. In this case, media conditions can be 
established so that only colonies which have expression of these genes activated will grow. . 
is Other genes confer phenotypes which can be screened. A screenable phenotype can often - 
yield Information about levels of gene expression. While any desired marker can be used, 
based on these properties, useful antibiotic resistance (Anb*) markers include but are not 
limited to, Cm R , Km R and Gm R A preferred norHtmiting example of a selectable marker Is 
a chloramphenicol acetyltransfcrase (CAT) gene. In a preferred embodiment, the 
20 selectable marker will be flanked on both sides by a recombinase recognition site. It is 
advantageous if the selective marker is removed after integration into the target cell. 

Recombinase sites are well known in the art and generally fall into two distinct 
families based on their mechanism of catalysis and reference is made to Huang at aL, 
(1991 ) NAR. 19:443; Datsenko and Warner (2000) Proa Natl. Acad. Scf 97:6640-6645 and 
25 " Nunes-Duby, D, et al F (1998) NAR 26:391-406, Preferably the recognition sites are the 
same- One well known recombination system is the Sacchatvmyces Flp/FRT 
.recombination system-whlch comprises a Ftp enzyme and^two asymmetric 34 bp FRT 
minimum recombination sites (Zhu et al. v (1 995) J, Chem 270:11646-1 1653). A FRT 
site comprises two 13 bp sequence inverted and Imperfectly repeated, which surround an 8 
30 bp core asymmetric sequence where crossing-over occurs. (Huffman et al-, (1999) J. Moi 
BfoL 286:1 - 13). Another well known recomblnase system is the Cre/loxP site-specific 
recombination system of bacteriophage P1, which comprises a Cre enzyme and two 
asymmetric 34 bp loxP recombination sites (Sternberg and Hamilton (1981) J. MoL BhL 
150:467-466); Palmares. B. et al (2000) Gena 247:255-264; Hoess et aL (1986) NAR 
m 14:2287-2300; Sauer B. (1 994) Curr. Opinions in Bhtechnoi 5:521 -527). A loxP site 
comprises two 13 bp sequences, inverted and imperfectly repeated, which surround an 8 
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bp core asymmetric sequence, where crossing-over occurs. The Cre-depandent 
intramolecular recombination between two parallel JoxP sites results is excision of any 
intervening DNA sequence as a circular molecule, producing two recombination products, 
each containing one faxP site (Kilby et al., <1 993) Trends Genet 9:41*421 ) r 

5 

C. Transformation 

Once a mutagenic DNA integration cassette Is constructed It may be introduced into 
a host cell by means wen known in the art Some of these transfer techniques include 
transformation, transduction, conjugation and protoplast fusion. A variety of transformation 

io procedures are known by those of skill in the art for deleting or overexpresslng genes or 
gene fragments in the host cell chromosomes. General transformation procedures are 
taught in Current Protocols in MOLECULAR biology (vol. 1 , edited by Ausubel et a!;. 
John Wiley & Sons. Inc. 1987, Chapter 9) and include calcium phosphate methods, 
transformation using DEAE-Dextran and electroporatlon. Reference is also made to USP 

is 5,032.514; Potter H. {1 988) Anal BiOChem 174:361-373; and SambrooK , J, et at, 
Molecuiar Cloning: A Laboratory MANUAL. Cold Spring Harbor Laboratory Press 
(1989). 

Whether a host cell has been transformed can be detected by the 
presence/absence of marker gene expression which can suggest whether the nucleic acid 
20 of interest is present. However, its expression should be confirmed. For example, if the 
nucleic acid encoding a pathway enzyme Is Inserted within a marker gene sequence, 
-recombinant cells containing- the insert can be Identified by the absence of marker gene 
function. 

Expression of the marker gene in response to induction or selection usually 
2s ' indicates expression of the enzyme as welL Alternatively, host cells which contain the 
. coding sequence for a pathway enzyme end express the enzyme may be identified by a 
A/ariety of procedures knowa-te4hose of skill In the art - These procedures include, but are 
not limited to, DNA-DNA or DNA-RNA hybridization and protein bioassay or immunoassay 
techniques which include membrane-based, solution-based, or chip-based technologies for 
■ 30 the detection and/or quarrtificatlon of the nucleic acid or protein. Additionally, the presence 
of the enzyme polynucleotide sequence in a host microorganism can be detected by DNA- 
DNA or DNA-RNA hybridization or amplification using probes, portions or fragments of the 
enzyme polynucleotide sequences. 

In a preferred embodiment once the mutagenic DNA cassette is introduced into a 
35 host cell an endogenous global regulatory gene is inactivated, and preferably an arcA 
gene, a rpoS gene or homologous genes thereto Is inactivated by deletion and preferably 
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deleted by homologous recombination. This general methodology is described in Datsenko 
and Werner (2000) proc. Natl. Acad. Sd. 97:664f>6645). 

Alternatively when the aroA gene is to be deleted, a chloramphenicol resistance 
gene may cloned Into a unique restriction site found in the arcA gene. The Cm R gene may 

5 then be inserted Into the structural coding region of the gene at the respective site. 
Modification is then transferred to the chromosome of a Bcoti arcA by homologous 
recombination using a non-replicating R6K vector. The Cm R gene may then be removed 
from the arcA coding region leaving an Interrupting spacer in the coding region, which 
results in inactivation of the coding region. In another embodiment, the Cm* gene may be 

ift inserted into the codTng region in exchange for portions of the coding region. Subsequent 
removal of the Cm R gene without concomitant reinsertion of the exchanged out portion of 
the coding region results In an effective deletion of a portion of the coding region, 
inactivating such region. The end result is that the deleted gene Is effectively non- 
functional. 

U in another embodiment mactivation Is by Insertion. For example a DMA construct 

will comprise a nucleic acid sequence having the arcA gene interrupted by a selective 
marker. The selective marker will be flanked on each side by sections of the arcA gene 
coding sequence. The DNA construct aHgns with essentially Identical sequences of the 
endogenous arcA In the host chromosome and in a double crossover event the 

30 endogenous arcA gene Is Inactivated by the insertion of the selective marker. 

In another embodiment, inactivation is by insertion in a single crossover event with a 
piasmld as the vector. For example, the aroA gene to be excised fs aligned with a plasmid 
comprising polynucleotides of the arcA gene or part of the gene coding sequence and a 
selective marker. The selective marker may be located within the gene coding sequence or 

2$ on a part of the plasmid separate from' the gene. The vector is integrated into the host cell 
.chromosome, and the gene Is Inactivated by the insertion of the vector in the coding 

sequence. - ■ - - - - . _ 

Inactivation may also occur by a mutation of the gene. Methods of mutating genes 
.are well known in the art and Include but are not limited to chemical mutagenesis, stte- 

so directed mutation, generation of random mutations, and gapped-duplex approaches. ( USP 
4,760,025; Moring et al.. Biotech, 2:646 (1964); and Kramer et al., Nucleic Acids Res. 
12:9441(1984)). 

Inactivation may also occur by applying the above described inactivation methods to 
the respective promoter regions of the desired genomic region. 
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P. Qo ^ruction of DNA cassettes for overexpress'ion of ppc 

The preset invention encompasses overexpresslng a ppc gene. The ppc gene and 
homologous genes thereto can be overexpressed by the introduction of a DNA cassette 
comprising of multiple copies of the ppc gene or by the introduction of a DNA cassette 

s comprising a regulatory sequence including a promoter operably linked with a ppc. 

In one preferred embodiment overexpresskin Is accomplished by replacing an 
endogenous regulatory region of a chromosomal ppo gene with a exogenous promoter. As 
used herein with respect to ppc, an exogenous promoter is a promoter other than a 
naturally occurring promoter, which Is opera bry linked to en endogenous coding region of a 

10 phosphoenolpyruvate carboxylase (Ppc) in a host cell end Includes but is not limited to non- 
native promoters, synthetic promoters, and modified naturally occurring promoters which 
include a) native endogenous promoters which are operably linked to a polynucleotide 
encoding a Ppc. wherein the native promoter has been altered end then reintroduced into 
the host cell chromosome and b) native endogenous promoters which are not operably 

is linked to a polynucleotide encoding a Ppc, 

In a preferred embodiment a DNA cassette is constructed which comprises 
regulatory sequences including an exogenous promoter as defined above and further 
Includes a selectable marker; sequences allowing autonomous replication or chromosomal 
integration, such as recombinase recognition sites; and flanking sequences, which are 

20 located upstream (5') and downstream (3'} of the recombinase recognition sites. 

A regulatory region and specifically including a promoter useful In a DNA cassette 
according to the invention includes*$equences of between about 20 to 200 bp, of between 
about 20 to 150 bp and of between about 20 to 100 bp. Exemplary exogenous promoters 
indude glucose jsomerase (Gl) (NCBI NC 003074), tro. Tac, Trp, Lac, LacUN/5, BJa, 

25 Pft, Pfa* Pai, Pas. Pas and F\ and derivative promoters thereof (Deuschle et el., (1936) 
{BMBO J. 5: 2987-2994; (Amann et al., (1983) Gene 25:167-178 and Amore et a!. (1989) 
AppL h/Ucrobloh BiotechnoL 30:351-357)^Preferred promoters Include the tfC promoter and 
derivatives thereof (Amann et al., (1983) supra) and the <5I promoter (also known as a 
xylose isomerase promoter), short Gl promoters and derivatives thereof. Reference is 

30 made to Amore et al. (1 989) supra. 30:351-357. The sequence of a segment of the Gl 
promoter (+50 to -7 of the -1 0 box) Is set forth in S£Q ID NO. 19 
5' CGAGCCGTCAaSCCC TTGACAA TGCCACATCCTGAGCAMTAAT 3' 
wherein the -35 box is represented by TTGACA and the -10 box is represented by 
AATAAT. Additionally, a preferred promoter Is the short 1.6GI promoter as illustrated in 

as SEQ ID NO, 20. 



PACE 33/72 ■ RCVD AT 7/8/2004 2:51:58 PM [Eastern Daylight Time] " SVR:USPTO-EFXRF-1/5 * DNIS:8729306 * CSID:202 457 0107 ■ DURATION (mm-ss):29-32 



JUL-0B-2004 15: 15 RJINOMOTO CORP SERU LLC 202 457 0107 P. 34/72 



WO 2<HM/033421 PCT/liS2003/4)3144S 

-27- 

A derivative promoter may Include a modification to at least one nucleotide In a 
position corresponding to a nucleotide in the -35 box. Hnker region or -10 box. In a 
preferred embodiment these derivative promoters are altered In a position corresponding to 
a position In the -35 box. Particularly preferred derivative promoters Include a modification 

s to a -35 box corresponding to TTGACA and TTTACA. Some TTGACA modifications include 
TTGAAA, TTCAC and CTGACA, One particular modification is to the position 
corresponding to position -35. Particularly preferred derivative promoters also Include a 
modification to a -10 box corresponding to TATAAT, TAAGAT and TATQTT. Linker regions 
may also be modified (Burr et ah, (2000) NAR 28:1 864-1870). Preferably linker regions • 

io whether modified or unmodified are between 14 to 20 bp in length, preferably 1 6 bp, 17 bp, 
18 bp and 19 bp. Those skilled in the art are well aware of methods used to make 
modifications in promoters and the present invention is not limited by these methods. One 
exemplary method includes use of the Quikchange Kit (Stratagene); Reference Is also 
. made to WO 98/07846; Russell and Bennett (1982) Gene 231-243 and Sommer et ah 

is (2000) Microbiol. 1 46:2643-2653. 

A selectable marker can be used to detect Insertion of the ppc gene in a host cell as 
previously described . 

in a preferred embodiment a DNA cassette comprising the promoter to be 
integrated into a host cell chromosome at a ppc target site will include a selectable marker 

20 flanked on both sides by e recomblnase recognition site. Recomblnase sites are described 
above for the DNA cassettes useful for toactivatton of arcA and rpoS. Also see Huang et 
■ al, (1991J NAR. 19:443; Datsenko and Warner (2000) Proc; NaU. Acad. Sci 97:6640-6645; 
Nunes-Duby, D f et al, (1998) NAR 26:331-406; Zhu et al„ (1995) J. BfoL Chem 270:1 1646- 
11653; Huffman et al-. (1999) J, Mol, BfoL 286:1 - 13; Sternberg and Hamilton (1981) J. 

« Mol Biol. 150:467-486); Palmeros, B, et al (2000) Gepe 247:255-264; Hoess et aL (1986) 
. NAR 14:2287-2300; and Sauer B. (1994) Cvrr. Opinions in Biolechnof. 5:521-527).) 
— - - One preferred recomblnase system is the Cre/loxP site-specific recombination 
system of bacteriophage P1» which comprises a Cre enzyme and two asymmetric 34 bp 
JoxP recombination sites (Sternberg and Hamilton (1981) J. MoL Biol 150:467-486); 

30 Palmeros, B. et al (2000) Gene 247:255-264; Hoess et al. (1 986) NAR 14:2287-2300; 
Sauer B. (1994) Curr. Opinions in Bfotechnol. 5:521-527 and Wlby et aL. (1993) Trends 
Genet. 9:414-421). Preferably the recognition sites are the same. 

An integration DNA cassette useful for effecting overexpression according to the 
invention will also include nucleic acid sequences homologous to upstream (5*) regions of a 

as gene encoding a Ppc protein. These homologous sequences will preferably flank the first 
recombinase recognition site (5* thereto) and the promoter <3 4 thereto). Nucleic acid 
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sequences homologous to upstream (5*) regions of a gene encoding a Ppc protein Include 
sequences derived from a) a sequence 5' to the endogenous regulatory region that Is 
targeted for modification, and b) a sequence 3' of the endogenous regulatory region that Is 
targeted for modification. The 3' sequence may include parts of a Ppc coding sequence. A 
s homologous flanking sequence may Include from about 2 to 260 bp, about 5 to 200 bp. 
about 5 to 1 50 bp, about 5 to 100 bp and about 5 to 50 bp: 

The E. goU sequence of Ppc has been deposited in the NCBI database as NP 
418391. Methods of obtaining a desired gene from bacterial cells are common and well 
known in the art of molecular biology. For example, if a sequence of a gene is known, 
10 suitable genomic libraries may be created and screened. Once a sequence is isolated the 
DNA may be amplified using standard techniques such as polymerase chain reaction 
(PCR) (USP 4,683,202) to obtain amounts of DNA by restriction. Also reference is made to 
Sambrook et aL. supra > Additionally publicly available computer programs can be used to 
determine sequences with identity to a Ppc. Preferred programs include the GCG Pileup 
is program, FASTA (Pearson et aL (1988) Proa fVatt Acad. SoL USA 85:2444-2448) and 

BLAST (BU\$T Manuel, Attschul et aL. Natl. Cent Bkrtechnol. Inf., Nati Ubrary Med. (NCBI 
NLM), NIH, Beihesda MD and Attschul et aL. (1997) NAR 25:3389-3402). 

Once a DNA integration cassette for ppc overexpresslon fe constructed it may be' 
introduced Into a host cell as described above for the DNA mutagenic cassette. The 
2D engineered cell preferably expresses ppo at a level higher than the level Of ppc expressed 
in a comparable wild-type cell. This comparison can be made In any number of ways by 
-one of skill in the art and is done under comparable-growth condrBons. For example, Ppc 
enzymatic activity can be quantified and compared using various assays known to those 
skilled in the art For example, Ppc activity can be quantified using cell free extracts by a 
as coupled assay (Ftores and Gancedo 1997). This method involves incubating at room 
temperature a ultracentifuged (50,000 rpm, 1 h, 4*C) cell free extract sample in a cuvette 
-that contain 0.22 rnM NADH, 1-1 mM phosphoenolpyruvate (PEP). 0.25 mM aeetyl-CoA, 
and 6 U of malate dehydrogenase (MDH) in 0,1 M Tris/HCl, pH 8-5 buffer with 1 1 mM 
sodium bicarbonate end 11 mM magnesium sulfate, in a total volume of 1 .0 mL A 
so background rate of the reaction of enzyme and NADH ie first determined at 340 nrn. The 
second substrate, PEP, is subsequently added and the absorbance change over time is 
•further monitored. PPC activity is defined by subtracting the background rate from the 
gross rate. 



/ 
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g , inaHrvatlon of other oenes 

Methods as described above may be used to inactivate other genes, and 
engineered bacterial strains according to the invention may be further modified to Include 
these Inactivated genes. 

s 

lnftrilvatonof add 

Catabollsm of gluconate via the Entn^M^oudoroff pathway Is controlled by the GntR 
regulon. It has been identified as being located at 1931.7 B.cotl (Conway t et aL (1991) J. 
Bacterid., 173:5247-5248 and Rgan et aL (1992) J, Bacterid. 174:4538-4646). Gluconate 

id Is the only carbon source known to require the eofcf gene product for efficient metabolism, 
(Sweeney et aL. (1996) Infect. Immun. 64:3497-3503). The edd gene (NCBI AAC74921) 
encodes 6-phosphogluoonate dehydrase (EC 4,2.1 .12) which converts gluconates 
phosphate into pyruvate and glyceraJdehydes-3-phosphate in the cafabolism of gluconate. 
(Peekhaus & Conway. (1998) J. Bart , 1 80: 3495-3502). It has been observed that the 

16 inactivatlon of edd does not preclude growth on gluconate and that gluconate metabolism 
wilt occur through the pentose pathway. The participation of the Entner-Doudoroff pathway 
in cells growing on glucose as the only carbon source has not been reported. However, 
considering that under those conditions, the majority of the carbon Is assimilated through 
the glycolytic pathway and only a small percentage of the carbon goes to the pentose 

20 pathway, the participation of the Entner-Doudoroff pathway in glucose assimilation (if any), 
has been perceived to be minimal. Glucose metabolism through the Errtner-Doudoroff 
pathway is less efficient In-the production of energy because by forming directly one 
molecule of pyruvate and one molecule of glyceraldehydes-3-phosphate, skips the 
formation of other intermediates like Fructo$e 1 t 6 biphosphate and Fructose 6-phosphate, 

2s • forcing the cells to synthesize these intermediates through gluconeogenic reactions that are 
more costly from the energetic point of view. Therefore in one embodiment it could an 
advantagecus-to further engineer host cells to have a nonfunctional edd. 

The absence of the phosphogluconate dehydratase catalyzed end products would 
indicate the inactivation of the gene. For example, phosphogluconate dehydratase 

30 enzymatic activity can be quantified and compared using various assays known to those 
skilled in the art For example. Edd activity can be quantified using HPLC methods (Taha 
et aL, (1994) >toaJ(Ktfca/£^ 

inactivatlon of ota and ackA 
35 In many bacteria that are metabolizing glucose at a high rate under aerobic 

conditions, the carbon flow exceeding the capacity of the TCA cycle is converted to acetic 
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add for secretion outside of the cells. In Ecotf , acetate is formed from Acetyl-CoA by the 
action of 2 enzymes: phosphotransferase (E.C. 2.3,1.8) end acetate kinase (B.C. 
2.7.2.1), encoded by the pte(NCBl entry AE000319) and ackA (NCBI entry AE000318) 
genes respectively. It has been reported that the inactivation of the pta and ackA genes 

s does not eliminate the formation of acetate completely. (Ricci et aL. (1991) BiotechnoL 
Bloeng. 38: 1318-1324). Furthermore, the Inactivation of these genes caused the secretion 
of other organic acids like lactate and pyruvate, and in general, it had a negative efTect on 
the metabolism of B.coli under both aerobic and anaerobic culture conditions (Diaz-Ricd et 
aL, (1991) fifofsc/irto/. Bioeng. 39: 1318-1324). However, the elimination of the acetate 

io pathway could have a beneficial impact on a fermentation process. 

Phosphotransacetylese enzymatic activity can be quantified and compared using 
various assays known to those skilled In the art. (De Spiegeleer, Bart, et al, Anal. Biochem. 
(1986), 158(1 ), 1 35-200). 

15 Inactivation gfjgggft 

The widespread occurrence of the methylglyoxal pathways in bacteria suggested 
that it may function as a bypass of the glycolytic pathway by converting ctehydroxyacetone 
phosphate (an intermediate of the upper part of the glycolytic pathway) into pyruvate. 
However. In E,colJ, it has been shown that the methylglyoxal pathway can not replace 

20 nonfunctional glycolytic pathway reaction such as glyceraldehyde-3-phosphate 

dehydrogenase, triosephosphate isomerase, phosphoglycerate kinase or enotase. (Cooper 
RA.; (1984) Ann. Rev. Microbiol. 38.-49-68). These results seem to indicate that the 
methylglyoxa) pathway Is not Involved In glucose metabolism for growth. Considering that 
methylglyoxal is produced in all cells by a variety of mechanisms, and that methylglyoxa! Is 

25 a highly reactive compound with a strong toxic effect, this pathway has been seen as a 
detoxification mechanism (Ferguson et al„ Arch. Microbiol. 170: 209-219 (1998) and 
Ferguson G.P. Trends Microbiol 7: 242-247 (1999)). However, it may be advantageous to 
inactivate the mgsA gene (NCBI AE000198) which encodes a methylglyoxal synthase 
enzyme (E.C. 2.1,1.21), that converts dehydroxyacetone phosphate Into methylglyoxal, 

30 and inactivation of the gene may eliminating unnecessary formation of methylglyoxal by this 
enzyme, (see Flg.2) 

As described above in this disclosure, homologous genes of edd, ackA. pta and 
mgsA in other species or strains can be obtained by generating cDN A from RN A from such 
species using any technique known in the art, such as using RIboclone cDNA Synthesis 

39 Systems AMV RT (Promega. Madison, Wis.); hybridization studies, and' computer 
alignment programs. 
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Fg Call Cultures and fe rmentations - 

Methods suitable for the maintenance and growth of bacterial cells are well 
* known and reference is made to the MANUAL of Methods of General 

9 Bacteriology. Eds. P. Gerhardt et aL, American Society for Microbiology, 
Washington DC (1961) and T O. Brock in BIOTECHNOLOGY: A TEXTBOOK OF 
INDUSTRIAL MICROBIOLOGY, 2nd ed. (1939) Sinauer Associates, Sunderland, MA. 

Cell Precuttures 

10 Typically cell cultures are grown at 25 to 32 °C and preferably about 28 or 29 *C in 
appropriate media. White the examples describe growth media used, other exemplary 
growth media useful in the present invention are common commercially prepared media 
such as Luria Bertani (LB) broth, Sabouraud Dextrose (SO) broth or Yeast medium (YM) 
broth. Other defined or synthetic growth media may also be used and the appropriate 

1$ medium for growth of the particular microorganism wiM be known by someone skilled in the 
art of microbiology or fermentation science. Suitable pH ranges preferred for the 
fermentation are between pH 5 to pH 8 where pH 7 to pH 7.5 for the seed flasks and 
between pH 5 to pH 6 for the reactor vessel. 

It will be appreciated by one of skill In the art of fermentation microbiology that, now 

*> that Applicants have demonstrated the feasibility of the process of the present invention a 
number of factors affecting the fermentation processes may have to be optimized and 
- - controlled in order to maximize the-ascorbic add intermedfate production. Many of these 
factors such as pH, carbon source concentration, and dissolved oxygen levels may affect 
the enzymatic process depending on the cell types used for ascorbic acW intermediate 

25 production. 

..Femientatfan Media: -~ 

^ ^Fermentation media in the present invention must contain suitable carbon 

substrates which wfll include but are not limited to monosaccharides such as glucose and 
•fructose, oligosaccharides such as lactose or sucrose, polysaccharides such as starch or 

so cellulose and unpurified mixtures from a renewable feedstocks such as cheese whey 

permeate, comsteep liquor, sugar beet molasses, and barley malt. Additionally the carbon 
substrate may also be one-carbon substrates such as carbon. Although it is contemplated 
that all of the above mentioned carbon substrates are suitable In the present invention 
preferred are the carbohydrates glucose, fructose or sucrose. In one embodiment, the 

95 concentration of the carbon substrate is from about 55% to about 75% on a weight/weight 
basis and also from about 60 to about 70% on a weight/weight basis. In addition to an 
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appropriate carbon source, fermentation media must contain suitable minerals, salts, 
vitamins, cefaclors and buffers suitable for the growth or the cultures and promotion of the 
enzymatic pathway necessary for ascorbic acid intermediate production. 

* Batch and Continuous Fermentations: 

The present process employs a fed-batch method of fermentation for its culture 
systems. A classical batch fermentation is a closed system where the composition of the 
media Is set at the beginning of the fermentation and not subject to artificial alterations 
during the fermentation. Thus, at the beginning of the fermentation the media is inoculated 
10 with the desired organism or organisms and fermentation is permitted to occur adding 
nothing to the system. Typically, however, a "batch" fermentation Is batch with respect to 
the addition of carbon source and attempts are often made at controlling factors such as pH 
and oxygen concentration. In batch systems the metabolite and blomass compositions of 
the system change constantly up to the time the fermentation Is stopped. Within batch 
is cultures cells moderate through a static lag phase to a high growth log phase and finally to ■ 
a stationary phase where growth rate is diminished or halted- If untreated, cells in the 
• stationary phase will eventually die. Cells in log phase generally are responsible for the 
bulk of production of end product or intermediate. 

A variation on the standard batch system is the Fed-Batch system- Fed-Batch 
20 fermentation processes are also suitable in the present Invention and comprise a typical 
batch system with the exception that the substrate is added in increments as the 
-fermentation progresses.- Fed-Batch systems are -useful when catabolrte repression is apt 
to inhibit the metabolism of the cells and where it Is desirable to have limited amounts of 
substrate in the media. Measurement of the actual substrate concentration in Fed-Batch 
2s systems is difficult and is therefore estimated on the basis of the changes of measurable 
. --.factors such as pH, dissolved oxygen and the partial pressure of waste gases such as CO* 
•Batch-and Fed-Batch fermentations are common and well known inthe art and examples 
may be found in Brock, suprs^ 

Although the present Invention Is performed in .batch mode it is contemplated that 
ic the method would be adaptable to continuous fermentation methods. Continuous 

fermentation is an open system where a defined fermentation media is added continuously 
to a bioreactor and an equal amount of conditioned media Is removed simultaneously for 
processing. Continuous fermentation generally maintains the cultures at a constant high 
density where cells are primarily in log phase growth. 

« r 
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Continuous f ermentations: 

Continuous fermentation allows for the modulation of one factor or any number of 
factors that affect cell growth or end product concentration. For example, one method will 
maintain a limiting nutrient such as the carbon source or nitrogen level at a fixed rate and 

s .allow all other parameters to moderate- In other systems a number of factors affecting 
growth can be altered continuously while the cell concentration, measured by media 
turbidity, is kept constant. Continuous systems strive to maintain steady state growth 
conditions and thus the cell loss due to media being drawn off must be balanced against 
the cell growth rate in the fermentation. Methods of modulating nutrients and growth factors 

10 for continuous fermentation proce$$es as weli as techniques for maximizing the rate of 
product formation are well known in the art of industrial microbiology and a variety of 
methods are detailed by Brock, supra. 

It Is contemplated that the present invention may be practiced using either batch, 
fed-batch or continuous processes and that any known mode of fermentation would be 

« suitable. Additionally, it is contemplated that cells may be immobilized on a substrate as 
whole ceil catalysts and subjected to fermentation conditions for ascorbic acid intermediate 
production. 



G. Identification and Purification of desired products. 

2e The presence of desired products and produced end-products can be verified by 

running a HPLC analysis. Samples can be drawn at predetermined time periods and 
-analyzed forthe presence or quantftyof a specific-product or intermediate. For example, 
samples can be periodically drawn off the fermentation reactor vessel and loaded onto 
Dionex (Sunnyvale, CA, Product No. 043118) Ion Pac AS 10 column (4 mm times 250mm) 

25 connected to a Waters 2690 Separation Module and a Waters 41 0 Differential 
Refractometer(Milford, MA). 

A-method of-determlning the prc<luction-erBcjency of the modified host cell is an 
analysis of the substrate consumption or of QO z production. As used herein, "Oxygen 
Uptake Rate on "OUR" refers to the determination of the specific consumption of oxygen 

ao within the reactor vessel. Oxygen consumption can be determined using various on-line 
measurements. In one example, the OUR (mmol/(liter*hour)) is determined by the following 
formula: ((Airflow (standing liters per minute) / Fermentation weight (weight of the 
fermentation broth in kilograms)) X supply 0 2 X broth density X (a constant to correct for 
airflow calibration at 21 .1 C instead of standard 20.0 C)) minus ([airflow /fermentation 

35 weight} x [offgas O^offgas X supply N 2 X broth density X constant ). 
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Carbon evolution rate (CER) refers to the determination of bow much CO z is 
produced within the reactor vessel during fermentation. Usually, since no CO a Is initially or 
subsequently provided to the reaction vessel, any CO* is assumed to be produced by the 
fermentation process occurring within the reaction vessel. "Off-gas CO*" refers to the 

5 amount of CO2 measured within the reactor vessel, usually by mass spectroscopic methods 
known in the art. As used herein, •yield" refers to the amount of product/the amount of 
substrate. The yield can be expressed as a weight % { product gm/substrate gm) or as 
moles of product/moles of substrate. For example, the amount of the substrate, e.g., 
glucose, can be determined by the feed rate and the concentration of the added glucose. 

10 The amount of products present can be determined by various spectrophotometer or 

.analytic methodologies. One such methodology te. high performance liquid chromatography 
(HPLC). An increased yield refers to an increased yield es compared to the yield of a 
conversion using the wild-type organism, for example an increase of at least 5%, 10%, 
20%. 30%, 40%, 50% 75%. 90% over the yield of the wild-type. In addition, those skilled In 

15 the art will recognize that assays can be used to determine the amounts and/or presence of 
glucose and desired products, such as glycerol, 1 ^-propanediol. 

Methods for the purification of the desired end products or Intermediates from 
fermentation media are Known in the art Desired products Include succinate, glycerol, 1 ,3- 
propanediol, cborismate, pyruvate, ethane!, acetate and ascorbic add (ASA) intermediates, 

20 such as gluconate, 2-keto-D-gluconic ecid (2KD(3) T 2. 5DKG and 2-keto-4--gulonic acid 
(2KLG). 

- A protein is substantially pure-when at least about 60% to 75% of a sample exhibits 

a single polypeptide sequence. A substantially pure protein typically comprises about 60 to 
90% by weight of a protein sample, more usually about 95% and preferably at least 96%. 
25 97%, 9a% end 99%. 

The manner and method of carrying out the present invention may be more fully 
-understood by those of-skfU In -the art- by reference tothe following examples, which 
examples are not intended in any manner to limit (he scope of the present invention or of 
the claims directed thereto. All references and patent publications referred to herein are 
30 hereby incorporated by reference. 

The invention is better understood by reference to the following examples, which 
merely illustrate but not limit the best mode now known for practicing the invention. 
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Generat Methods 

Methods used for the construction of chromosomal modifications tn £ coii strains 
are described below. 

s Isolation of £ coii chromosomal DNA was as performed using the commercial kit 

UHraClean (MoBlo Lab. Inc. Solana Beach. CA). To transform & coii strains with PCR 
products, cells were grown and made competent by the procedure described by Datsenko 
& Wanner ((2000) Proa Nati. Acad. Sci 97: 6640-6645). Transformants carrying the 
plasmld were grown in 6-rnl SOB cultures (Hanahan, D (1983) J. Mot. Biol. 166, 667-560) 

io wfth ampldllin and L-arabinose at 30 C to an OD600 of » 0.6 and them made electro 

competent by concentrating 100-fold and washing three times with ice-cold 10% glycerol. 
Plasmld DNA transformation in E.coli strains was done by electroporatlon of competent 
cells prepared as described by Sambrook, supra. 

Different products of cellular metabolism or media components like glycerol, 

15 glucose and 1 ,3-propanedlol (3PG), compounds r were quantified by HPLC as described by 
Emptage et al M In PCT publication WO 01/12833-A1. The conversion of glycerol to 1.3- 
propanedk)! was monitored by HPLC. Analyses were performed using standard techniques 
and materials available to one of skill In the art of chromatography. One suitable method 
utilized a Waters Maxima 820 HPLC system using UV (210nm) and Rl detection. Samples 

20 were injected onto a Shodex SH-101 1 column (6 mm x 300 mm, purchased from Waters, 
Milfbrd, MA) equipped with a Shodex SH-101 1P precolumn <6 mm x SO mm), temperature 
- - "controlled at 50 C C/ using 0.01 NH2SO4 as mobile phase "at a flow rate of 0.5 rnL/min. 
When quantitative analysis was desired, samples were prepared with a known amount of 
trimethyiacetic acid as an external standard. 

29 

Gene disruptions 

To Integrate DNA Into a spedflc region of the chromosome, homology of the 
Inserting DNA to the targeted chromosomal site and a selectable marker are required. 
• Preferably the marker will be removed after integration. The loxP/Cre recomblnase system 
ao from P1 phage and the FRT/Flp recombinase system from yeast provide a mechanism to 
remove the marker. The DNA mutagenic cassettes containing homologous nucleic acid 
sequences of the regulatory region encoding the expression of the global regulators to be 
inactivated and a selectable marker flanked by /oxP (Palmeros et aL (2000) Gene 247:255- 
264) or FRT (Datsenko and Wanner (2000) Proc. Waff. Acad. Sci. USA 97:6640-6645) sites 
« • is transformed by electroporatlon into a target host cell harboring pKD46 {Datsenko and 
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Wanner, supra). Transformants are selected by growth of the cells in the presence of the 
antibiotic. Subsequently. pKD46 Is cured from the cells and recombinase plasmtds ere 
introduced into the transformants for removal of the selectable marker. Strain integrated 
with a loxP cassette are transformed with pJW168 that encodes Cre recombinase 

9 (Pelmeros et ah, supra). Strain containing a FRT cassette are transformed with pCP20that 
encodes Flp recombinase (Datsenko and Wanner, supra). After removal of the Integrated 
selectable marker, the recombinase plasmlds are cured from the strain. 

P1 vir transduction was preformed as described in Miller. J.H. A Short Course in 
Bacterial Genetics. A Laboratory Manual and Handbook for Escherichia cotf and related 

10 Bacteria (1992) Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. 

Primers gn4 PCR verification 

Nucleic acid primers based on arcA, edd. ackA, pta, mgsA, pckA and ppc 
sequences can be prepared by standard techniques. Primers can be used, for example, for 

1* amplification of nucleic acid sequences, e~g., by the polymerase chain- reaction (PCR). See, 
e.g.. PCR Protocols: A Guide to Methods and Applications, Innis et aU eds., Academic 
Press: San Diego (1990). The preparation and use of probes and primers is described. 
e.g„ in Sambrook et ah (1989) or Ausubel et ah (1987). 

Three PCR reactions were used to confirm that mutants had the correct structure. A 

20 freshly isolated colony was suspended in 20 microliters of water with and 5 nl portions were 
used in separate 20 \d PCR reactions following a 2-minute preincubation, "hot start", at 
95*C. Common test primenTinclLJded those set forth in figure 3. "A reaction was carried out 
with flanking locus-specific primers to verify simultaneous loss of the parental (wildtype) 
fragment and garn of the new specific exogenous fragment The later was repeated after 

25 elimination of the selective marker, the antibiotic resistance gene. Control colonies were 
always tested skte by side. 

Construction of the oSYpQ constructs. 

The utility of the PTS~/Glu* strains to convert carbon from g lucose to a product was 

30 tested by plasmids carrying genes encoding enzymes that carry out conversion of DHAP to 
1, 3 propanediol. The pSYCO coristructs were pSC10l (Stratagene) based plasmlds that 
carry genes for conversion of DHAP (dlhydroxyacetone-P) to glycerol (da/i and gpp2) from 
Saccharomyces cerevisiae (referred to as the glycerol pathway) and subsequently glycerol 
to 1, 3 -propanediol (d/ieB1-3, dnaX, orfW, X, and Y from Klebsiella, (referred to as the 1,3- 

35 propanediol pathway). The pSYCO constructs used In the current examples were 
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pSYCO101, 103, 106 and 109 and reference is made to figures 6 and 7 which depict the 
nucleotide sequence and plasmid map of pSYCO 101 , respectively. The pSYCO103 
construct is identical to pSYCOIOI except the DNA region which includes the glycerol 
pathway genes and the two EeoR1 sites m the opposite orientation to that of pSYCOl 01. 
s The pSYCOl 06 construct Is identical to pSYCOl 03 except for the removal of the 1 26 bp of 
non-coding plasmid DNA between the £coR1 sites and bp 10569 - 11 14$. The pSYCO109 
construct Is Identical to pSYCO 106 except that the coding region ortW has been deleted. 
For the experiments described herein, the plasmids are functionally equivalent 

fk. CtelRtion of the arcA oene. 

The arcA gene was deleted in 2 E.colf strains: MG1655 (ATCC 47076) and strain 
KLNDH which is a derivative of Strain KLP23 described by Emptage el al <F>CT publication 
WO 01/1 2833-A1) where the ndh gene has been inactivated. To inactivate the arcA gene, 

« the procedure described by Datsenko & Wanner was utilized (Datsenko A Wanner (zooo; 
PfOC r NatL Acad. $cL USA, 10: 6640-6645). Briefly* the method uses a plasmid that codes . 
for 3 protein activities that increase the frequency of homologous recombination. These 
activities allow the use of small regions of homology - 36 to 50 nucleotides long, to 
promote the homologous recombination between the chromosome and a DNA mutagenic 

20 cassette. A PCR reaction is used to generate the mutagenic cassette that contains the 
regions of homology with the chromosome at both ends. In this particular example, the 
arcA mutagenic cassette was obtained by PCRusing piasmid pKD3 (see below) and the 
primers ArcA1 and ArcA2 (SEQ ID NO, 1 and SEQ ID NO. 2) that contain regions of 
homology to the arcA gene and to the FRT-cat cassette present tn pKD3. This plasmid has 

» been described by Datsenko & Wanner, supra. 

The proper integration of the arcA mutagenic cassette was confirmed by 
sequencing the chromosomal region using primers ArcA3and ArcA4 (SEQ ID NO. 3 and 
SEQ ID NO.4). 

30 B. Effect of the arcA deletion. 

To evaluate the Impact of the deletion of the arcA gene in strain MG1 $56, the 
growth of the strain was evaJuated in shake flasks. For such a purpose, 250 ml flasks 
containing 10 ml of minimal media TM2 [TM2 medium (g/L): K^HPO* (13.6 g/l ), KH 2 P0 4 
(13.6 g/i), MgS04-7H20 (2 g/l), citric acid monohydrate (2 g/l), ferric ammonium citrate (0.3 

u g/l), (NH^O* (3.2 g/l), yeast extract (5 g/l). solution of trace elements (1 ml), pH adjusted 
to 6,8. The solution of trace elements contained (g/L): citric acid. H2O (4.0 g/l), MnS04« 
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H z O (3.0) g/l, Naa (1,0 g/l). FeS0 4 -7H 2 0 (0.10 g/l), CoCl2*6H20 (0-10 g/l), ZnSO^T^O 
(0,10 g/l), CUSO4-5H2O (0.010 g/l), H3BO3 (0.010 g/l) r and Na 2 Mo04'2H 2 0 (0.010 g/l)] 
supplemented with 2% glucose, were Inoculated to a 1/200 dilution with an overnight 
culture of the strain MG1 655 (wlfcftype-wt) and MG1 655 AarcA grown in LB (5 g/L yeast 
s extract 1 0 g/L tryptone, and 1 0 g/L Naa). Cultures were incubated at 37°0 in a orbital 
shaker PsycroTherm (New Brunswick, Edison, NJ, USA) at 275 rpm. Samples were taken 
at different time Interval? end the Optical density (OD) at 600 nm was measured. Cells were 
removed by centrifugation end the supernatants were utilized to quantify glucose and 
acetate. Cultures were grown under medium, atmospheric pressure and temperature, such 
10 that dissolved oxygen was not limiting. For example at 34*C and atmospheric pressure of 
760mmHg, in H a O the dissolved 0* is 7mg/ml. Conditions in medium will be lower (Vesilind, 
PwA, (1996) Introduction to Environmental Engineering, PWS Publishing Company 
Boston, MA Table 6.2 - 6). 

AS shown Ui Fig, 4a and 4b compared to the MG1S55 strain, strain MG1655 with the 
is arcA deletion ( AarcA) grew to a higher cell density (ODeoo of 35 versus 25) and produced 
less acetate (approximately 1 g/l compared to 9 g/l). These results Indicate that the AarcA 
strain was more efficient In converting media components into biomass, with less acetate 
as a metabolism by-product 

To evaluate the Impact of the arcA deletion (AarcA) m the production of a chemical 
20 compound, strains KLNDH (FM5 glpK-gldAnrtdn) and KLNDH AarcA were transformed with 
plasmid pSYCO101 (figures 6 and 7) which carries the genes to convert drhydroacetone- 
phosphate into glycerol- Strain FM5 is disclosed In US 5,494,816 and US 6,136,576 and 
reference is also made to ATCC Accession no. 5391 1. The resulting strains, KLNDH AarcA 
101 and KLNDH 101 were evaluated as described above in example 1b, except that the 
2s production of glycerol was also measured. As shown in table 1, compared to KLNDH, 
KLNDH AarcA strain produced more glycerol (28.7 compared to 22.5 g/l) and was more 
efficient for the conversion of glucose Into product (Molar yield = moles of product obtained 
per mole of glucose consumed) than the wildtype strain (1 .32 compared to 1 .02). 



TABLE 1 



STRAIN 


Glycerol fl/L 


Moiar Yield 


KLNDN 101 


22.5 


1.02 


KLNDH AarcA 101 


28.7 


1.32 
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EXAMPLE 2 

a. Inactivation of the rvoS gene. 

The rpoS gene was Inactivated in strain KLP23 (Empage et a., WO 01/12833) by 

0 transduction. A PHysate from strain JM101-#poS::Tnf 0 was prepared by standard 
methods. This strain contains the rpoS::Tn?0 loci from strain AMS150 described by 
McCann et al, (1991) J. Bacterid 173: 4188-4194. The P1-lysate was used to transduce 
theTnIO marker into KLP23. Transductants were selected on Lb plates containing 20 
mlcrograms/ml of Tetracycline. Colonies that appeared after 16-20 hrs. at 37 °C were 

1« further purified by streaking them onto fresh Tetracycline plates. Single colonies where 
checked for a decrease on hydrogervperoxfdase activity using the* bubbling test (Chou et 
al„ (1 996) BiotechnoL Bioeng. 50; 636-642). 

B. Effect of th e rooS::TN10 mutation. 

« To evaluate the effect of rpoS Inactivation a KLP23 rpoS:: Tn1 0 strain was 

transformed with pfesmrd pAH48 that codes for 2 yeast genes that convert 
dehydroxyacetone phosphate into glycerol (Ernptage et aJ PCT publication WO 01 12833- 
A). The resulting strain KLP23 rpoS/AS was analysed as described in example 1B. As can 
be seen on Figure 5, the inactivation of tpoS had a positive effect on glycerol production by 

20. Increasing the conversion of glucose Into glycerol (yield). 

EXAMPLE 3 

flu Oyerexpresslon of the^POCJaene 

To alter the regulation of the ppegene, Hs native regulatory region was replaced by 
2* ' a synthetic constructlon that includes the strong promoter 1 .6-Gl (SEQ ID NO. 1 9). The ppc 
natural promoter was replaced by the short 1.6 Gl promoter (SEQ ID NO. 20) in 
FMP'1.5gapAmgsA (genotype FM5 glpk-gldA^h-ptsHlcrr-galPp-trc glkptrc arcA-edd- 
gapAp-1 ,5mgsA-). 

Construction of the 1 .6gapA in FMP 1 (s as follows: The native gapA promoter was 
ao replaced with a synthetic short 1 .5 Gl promoter (SEQ ID NO. 22) by replacing 225 bp of 
upstream gepA sequence (see Gene Bank Accession No. U00096)) with FRT-CmR-FRT 
and engineered promoter. The replacement cassette was amplified by PCR with the primer 
pair SEQ ID NO. 23 and SEQ ID NO- 24 using pKD3 as a template. The primer SEQ ID 
NO. 23 contains 39 bp of homology to gapA including the ATG start, the short 1 >5G| 
» promoter and 20 bp of homology to template pKD3. Primer SEQ ID NO. 24 contains 59 bp 
of homology to upstream gapA sequence and 21 bp of homology to pKD3. The PCR 
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products were gel<-purffied and electroporated into MG1655/pKD46 competent cells to give 
MG1655 1.5ga&A::Cm. Recombinant cells were selected on LB plates with 12,5 mg/L 
chloramphenicol. Successful Integration of the cassette replaces the region 34 - 258 bp 
upstream of the gapAATG start codon with a FRT-Cm R-FRT-abort 1,5 Gl promoter 

5 cassette. A P1 phage Systae was prepared and used to move to mutation to FMF*::Km. This 
strain was designated FMP*::Km 1.5gapA::Cm. 

The short 1.5 Gl gepA promoter in MG1655 1.5 gapA::Cm was replaced with a 1.6 
Gl promoter (SEQ ID NO. 20). To create the 1-6gapA strain, a replacement cassette was 
PCR amplified with primer pafr SEQ ID NO. 25 and SEQ ID NO. 26 using genomic DNA 

10 from MG1655 1.5gapA::Cm as template. The primer SEQ ID NO. 26 contains homology to 
the ga/>A upstream region in MG1655 1 .5 gapA::Cm and contains the 1.6 G I promoter 
(SEQ ID NO. 20). The 1 .6 Gl promoter replacement cassette was used to replace the 
native gapA promoter of MG1655 as described above to give strain MG1655 1.6gapA::Cm. 
This strain was then used to replace the natural gapA promoter of FMP' ::Km (using P1 

is transduction) to give~strain RV^:H^^ 

dehydrogenase activity was determined using cell-free extracts prepared from 
FMP'::Km1.5gapA::Cm, FMP*::Km 1.6gapA::Cm and FMP'::Km as control. The values 
obtained compared to that of the control. The strain FMP*::Km 1 .6gapA::Cm was 
transformed with the plasmid pSYCO106. 

2« Two sets of primers were used. The ppcR (SEQ ID NO. 17) primer is 1 00 

nucleotides long and includes the entire sequence from the +1 of P1 (natural ppc promoter) 
and transcription start to 41 bp upstream the ATG of ppc; the 1.6 Gl promoter sequence 
from 4 bp upstream the -35 to 9 bp downstream the —1 0 and the priming site for pKD3. The 
ppcF primer (SEQ ID NO. 18) is 80 bp long and contains 55 bp of sequence (1860478 to 

25 1 860536) from the intergenic region and the priming site for pKD3. 

The PCR primer utilized to amplify the excisable antibiotic marker, carries besides 
the regions of homology with the chromosome and the antibiotic cassette, the 1.6 Gl 
promoter. 

Once the proper genomic modification was corroborated by sequencing, the level , 
so of Phosphoenolpyruvate carboxylase (Ppc) enzymatic activity was measured from cell free 

extracts obtained from the shake flasks described in example 1. Aliquots of celts were . 

harvested in mid-log phase, broken by two passages through a French press cell. 

centr'rfuged for 15 mln at 14,000 rpm, and ultracentrifuged 1 hrat 50,000 rpm. The 

supernatant was removed and used as a source of proteins. Specific activities of Ppc are 
aa reported in Table 2. The replacement of the natural ppc promoter with the short 1 .6 Gl 

promoter (SEQ ID NO. 20) increased tha Ppc enzyme activity by three fold. 
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TABLE 2 



STRAIN 


OD 


PEP carboxylase 

specific activity U/mo orotein) 


FMP'1-5gap AmgaA, 
DSYCO106 


11.3 


0.28 


FMPI ,5gap Amgs 1.6ppc, 
pSYCO106 


9.3 


0.86 



5 B. Effect of Overexpres slng the 00c oene. 

Shake flask cultures were used to assess the conversion of glucose to 
1,3-propanedio) In E. co// strains FMP'1.5gap Amgs/pSYCO106 end FMP'1.5gap Amgs 
1,6ppc/pSYCO106. 

The strains, grown In LB medium containing 50 mg/L spectinomycin for 10 hrs, were 
10 used to inoculated (200 pi) Into 250 mi-baffled Erienmeyer f tasks containing 10 ml TM2 
medium, 20 g/L glucose. 50 mg/L spectinomycin, and 2 mg/L vitamin B1 2. 

The flasks were Incubated at 300 rpm and 34 0 C. Representative results are given 
In Table 3. Both an increase in the molar yield and a decrease of acetate production were 
observed with the addition of the 1.6 ppc mutation to the parent strain. 

15 

TABLE 3 



Strain 


OD350 


Glucose 
consumed 


Glycerol 

produced 

<S/L) 


1,3- 

propanediol 

produced 

(9/L) 


Acetate 

produced 

(g/L) 


Molar 
Yield 


FMP'1.5gap 
ArnasA/DSYCO106 


5.1 


17.1 


8.24 


2.19 


1.78 


1.25 


FMP'1 .5gap AmgsA 
1.6ddc/SYCO106 


12.6 


17.1 


7J& 


334 


D.34 


1.32 



20 EXAMPLE 4 

£u Pgletlpn Pfthe edd gene. 

The same procedure described for the deletion of a/nA was used to inactivate the 
edd gene, except that the Cat cassette was obtained from piasmid pLoxCat2 described by 
« palmeroe et al.^Palmeros et at. (2000) Gene 247:255-264). The primers ecfcfl and edd2 
(SEQ ID. NO. 5 and SEQ ID NO. 6) utilized to generate the mutagenic cassette which 
contained 80 bases of homology to the edd gene and 20 bases to pLoxCat2. 
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The edd gene was deleted in strain KLGG AarcA {FM5 gipK-gldA-ndh-ptsHlcrr-KmR 
galPp-trc glkp4rc arcA-). Proper integration of the cassette was confirmed by sequencing 
the chromosomal region using primers edo3 and ecW4 SEQ ID NO. 7 and SEQ ID NO- 8. 

s B- Effect of the delation of edd. 

To evaluate the Impact of the edd deletion In the production of a chemical 
compound, strains KLGG AaroA t and KLGG AarcA Aedd were transformed with plasmid 
pSYCO103 that carries the genes to convert dihydroacetone-phosphate Into glycerol, and 
glycerol Into 1,3, propanediol in the presence of vitamin B-12. The resulting strains KLGG 

10 03, and KLGG AarcA Aedd /1 01 were evaluated as described in Example 2» except 

that the production of glycerol and 1,3 propanediol was measured. As shown in table 4, 
compared With the AarcA strain, the Aedd strain produced more glycerol and 1 . 3 
propanediol. Furthermore, it was mere efficient for the conversion of glucose into products 
(Molar yield »moles of product obtained per mole of glucose). 

TABLE 4 



Strain 




Glucose 
consumed 


Glycerol 
<g/L) 


1.3- 
propanediol 
(9/L> 


Molar 
yield 


Titer 
Glyoerol + 

1,3- 
propanedioJ 


LGGAarcA/103 


6.2 


4.51 


1.77 


1.14 


1.44 


2.91 


KLGGAarcA/Aedd 
/103 


5.9 


5.82 


2.36 


2,14 


1.7 


4.5 



EXAMPLE 5 

A. Deletion of the mosA 

Deletion of the mgsA gene was performed as described above for arcA. The 
primers mgsA-1 (SEQ ID NO. 13) and mgsA-2 (SEQ ID NO. 14) were utilized to generate 
the mutagenic DNA cassette, which contained 36 bases of homology to the mgsA gene, 
and 20 bases of homology to pKD4. Plasmid pKD4 has been described by Datsenko & 
Wanner, supra. The mgsA gene was deleted in strain FMP'1.5gapA ( FM5 glpk-gldA-hdh- 
ptsHlcrr-galPp-trc glkp-trc arcA-edd-gapAp-1 .5). Proper integration of the DNA cassette 
was confirmed by sequencing the chromosomal region using primers mgsA-3 (SEQ ID NO. 
15) and mgsA-4 (SEQ ID NO. 16). 
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R. Pffcc^ gf tfte masA deletion. 

To evaluate the impact of the mgsA deletion In the production of a chemical 
compound, strains FMP'1 .5gap and FMPM .Sgap AmgsA were transformed wfth plasrnid 
pSYCO106 that carries the genes to convert dlhydroacetone-phosphate into glycerol, and 

s glycerol into 1,3, propanediol In the presence of vitamin B-12. The resulting strains were 
evaluated as described in example 1 B, except that the production of glycerol and 1 ,3 
propanediol was measured. As shown in table 5, compared with the parent strain the 
AmgsA strain produced more glycerol and 1, 3 propanediol. The cell efficiency 
Improvement due to the mgsA mutation was more visible at the Molar yield obtained (see 

■jo Table 5). 

TABLE 5 



Strain 


glycerol +1,3 
propanediol 


glucose consumed 
fe/L) 


Motor Yield 
(nrioVmol) 


FMP*1 *5gapA/pSYCOl06 


10£ 


17.6 


1.21 


FMP*1 .SgapAmgsA/pSYCOl 06 


10.7 


17.1 


1.26 



Molar Yield = motes of product formed per motes substrate used. 



1fi EXAMPLE 6 

A. Delation of the ackA-nta genes. 

The eokA and pfa genes are contiguous in the E.co// chromosome and deletion of 

these genes was performed as described above for arcA, The primers DacA-F (SEQ ID 

NO. 9) and Dpta-R (SEQ ID NO. 10) were utilized to generate" a mutagenic DNA cassette 
20 which contained 80 bases of homology to the ackA or pra genes respectively, and 20 

bases of homology to pKD3. 

The ac/cA and pta genes were deleted in strain Triplel .6 btuR1 .6yqhD (FM5 glpk- 

gldArndh^tsHlcrr-galPp-trc glkp-trc arcA-edd-gapAp-1 .5 mgsA«ppcp-1.6 ydK-btuRp-1 .6 

yqhC-yqhDp-1.6). 

a The proper integration of the cassette was confirmed by sequencing the 

chromosomal region using primers acWJ (SEQ ID NO. 11) and ackD (SEQ ID NO, 12). The 
resulting strain Triplel .6 btuRl.6yqbDackA-pta and the parent were transformed with 
pSYCO109, ) 

The strains were grown as described above except that the stability of the plasrnid 

so was examined by growing the strains in the presence or absence of spectinomycin 

(concentration of 100 mlcrograms/ml) and passing the culture every 24 hours by adding 16 
microliters of the previous days culture to a fresh batch of medium and allowing the strain 
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to grow. This sequential dilution of the culture was repeated every day for 5 days and 
samples were taken on a daily basis to assess the production of glycerol and 1 , 3~ 
propanediol. Typically in the absence of spectinomycln in the culture, the pSYCO 109 
plasmid would be quickly lost upon passage of the culture. 

B- Effect of t he ack-uta deletion. 

Table 6 



Strain I 


Spec* 


MY-D1 


MY-D2 


MY-D3 


MY-D4 


MY-D5 


TriptettuR 
PSYC0109 


+ 


1£4 


1-28 


1.0$ 


0.84 


0.79 


TripleiifciR 

Aack-pta, 
pSYCO109 


+ 


1.15 


1.22 


1,24 




1.07 


TripteJbftrf* 

1.6>qf>D t 

i>SYCO109 




1.23 


1.04 


0.95 


0.61 


035 


TripJehfiiR 
PSYCO109 




1.23 


1.22 


1.23 


1.13 


1.11 



MY = Molar YteW (moies of product formed per moles substrate used); DX - Day 1.2,3,4 
or 5 and Spec (+/-) means with or with spec* nomycln. 



1» In the presence of spectinomydn, the parent strain exhibits a 1 5% decline In molar 

yield by day 3 and a 38% decline by day 5 indicating that even in the presence of antibiotic, 
the production of glycerol and 1 , 3-propanedIol Is negatively effected by continuous passage 
of the culture. With the deletion of ack-pta, the molar yield declines by only 0.07% after 5 
days, indicating a significant increase in stability of production. 

20 This effect is even more pronounced in the absence of antibiotic. By day 5, the parent 
strain showed a 80% decrease in molar yield while the strain with the deletion of ack-pta 
showed only a 0.1% decrease m molar yield* Thus the deletion of these two genes 
significantly increased the stability of the plasmid and production of glycerol and 1,3- 
propanedM. 
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pLAIMS 

1 . A method of enhancing the production of a desired product in a bacterial host 
cell comprising, a) modifying a bacterial host call by inactivating an endogenous e/eA gene 
and b) culturing the modified bacterial host cell in suitable culture media comprising 
glucose under aerobic conditions to allow production of a desired product 

2* The method according to claim 1 . wherein the bacterial host cell is from a strain 
of the Enterobacteriaceae family. 

3. The method according to claim 2, wherein the bacterial host cell Is an £. coll or 
Pantoea ceil. 

4. The method according to claim 1, wherein the bacterial host cell is a PTS"7Glu + 

cell 

5. The method accord tag to claim 1 . wherein the desired product Is selected from 
the group consisting of glycerol, PEP* pyruvate, chorismate, ethanol, succinate and 

d ihydroxyacetone-R 

6. The method according to claim 5, wherein the desired product is chorismate, 

7. The method according to claim 6, wherein the chorismate Is further converted 
to an aromatic amino add. 

3. The method according to claim 1 further comprising Inactivating the expression 
of an endogenous gene encoding a polypeptide having RpoS activity. Edd activity. Pta 
activity, AckA activity or MgsA activity. . 

9. The method according to daim 1 further comprising transforming the bacterial 
host cell with a DNA fragment comprising an exogenous promoter, wherein the DNA 
fragment including the exogenous promoter is integrated into the host cell chromosome 
and replaces the endogenous promoter which is operably linked to a PEP carboxylase 
coding sequence wherein PEP carboxylase Is overexpresaed. 
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10. The method according to claim 1 further comprising isolating the desired 
product from the culture media. 

1 1 . The modified bacterial host cell obtained according to the method of claim 1 . 

12. A method of enhancing biomass production in bacterial host -cells comprising 
a) modifying a bacterial host cell by Inactivating an endogenous arcA gene and b) cufturing 
the modified bacterial cell under suitable culture conditions wherein said culture conditions 
include aerobic fermentation and glucose as a carbon source and wherein biomass 
production is enhanced in the modified bacterial cell compared to biomass production in a 
corresponding non-modified bacterial cell cultured under essentially the same conditions. 

1 3. The method according to claim 12 further comprising Inactivating an 
endogenous rpoS gene, 

14. The method according to darm 12, wherein the endogenous areA gene Is 
Inactivated by a deletion. 

15. The method according to claim 12. further comprising Inactivating an 
endogenous gene encoding a polypeptide having phosphogluconate dehydratase activity, 
phosphotransacetylase activity, acetyl kinase activity or methylglyoxyal synthase activity. 

1 6. The method according to claim 15, wherein the endogenous gene encoding a 
polypeptide having phosphogluconate dehydratase activity is an add gene. 

17. The method according to claim 15, wherein the endogenous gene encoding a 
polypeptide having phosphotransacetylase activity is a pta gene. 

18. The method according to claim 15, wherein the endogenous gene encoding a 
polypeptide having acetyl kinase activity is an ackA gene. 

19. The method according to claim 15, wherein the endogenous gene encoding a 
polypeptide having methylglyoxyal synthase activity is a mgsh gene. 

20. The method according to claim 12 further comprising isolating the modified 
bacterial cell. 
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21. The method according to claim 12, wherein the bacterial host cell Is selected 
from the group consisting Escherichia cells* Pantoea cells, Klebsiella cells, Gluoonobacter 
cells and Erwinte cells. 

22. The method according to claim 21, wherein the bacterial host cell is an £ coil 
cell or a Pantoea cell 

23. The modified bacterial cells obtained according to the method of claim 12. 

24. A genetically engineered bacterial strain of the Enterobacteriacea family 
comprising an inactivated endogenous arvA gene and an overexpressed polypeptide 
having PEP carboxylase activity. 

25. The genetically engineered bacterial strain of claim 24, wherein said strain Is 
selected from the genus consisting of Escherichia, Pantoea, Klebsiella, Gluconobact&r and 
Erwlnla, 

26. The genetically engineered bacterial strain of claim 24, wherein said strain is a 
strain of E, coll 

27. The genetically engineered bacterial strain of daim 24, wherein the 
endogenous arch gene is deleted. 

28. The genetically engineered bacterial strain of claim 24 further comprising an 
inactivated endogenous mgsfK. 

2d. The genetically engineered bacterial strain of dairn 24 further comprising an 
inactivated endogenous ecfcf* 

30. The genetically engineered bacterial strain of claim 24, further comprising an 
inactivated endogenous rpoS. 

31 . The genetically engineered bacterial strain of claim 24, wherein the 
overexpressed polypeptide having PEP carboxylase Is operably linked to an exogenous 
promoter. 
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32. The genetically engineered bacterial strain of claim 31 , wherein the exogenous 
promoter te a Gl promoter. 

33. A genetically engineered bacterial strain comprising an inactivated 
endogenous rpoS gene, 

34. The genetically engineered tecten*ai strain of claim 33. wherein the 
endogenous rpoS gene is deleted. 

36* The genetically engineered bacterial strain of claim 33 further comprising an 
overexpressed polypeptide having PEP carboxylase activity. 

36. The genetically engineered bacterial strain of claim 33, wherein said bacterial 
strain has a PTS"7Glu* phanotype, which was derived from a bacterial strain originally 
capable of utilizing a PTS for carbohydrate transport 

37. The genetically engineered bacterial strain of claim 36, wherein said bacterial 
strain is & cdL 

38. A method of enhancing the production of an aromatic emino acid in an £ cofi 
host cell comprising, a) modifying an £ coU host cell by inactivating an endogenous arcA 
gene and b) culturing the modified host cell in suitable culture media comprising glucose 
under aerobic conditions to allow production of an aromatic amino acid. 

39. The method according to claim 38, wherein the bacterial host cell is a 
-PT*S~7Glu*cell. - 

40. The method according to claim 38 further comprising isolating the aromatic 
amino acid from the culture media. 

• 41. The modified EL coii host cell obtained according to the method of claim 38. 

42. The method according to claim 38, wherein the endogenous arcA gene is 
inactivated by a deletion. 
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FIGURE 1q 



<5 



GLUCOSE 




PYRUVATE 
Acetyl Co A 

TCA cycle 




Acetate 



Figure leu 

Schematic representation of some interconnected meluboUc routes involved in 
glucose assi mJlation. The numbers represents the reactions described more In detail 
in Table 1. Reactions 1-3 coinprise the most efficient pathway for glucose 
assimilation. Reaction 4 is the sum of reactions 1-3. AH the other reactions (5-B) are 
alternative routes that in general, are less efficient or skip the formation of some 
important metabolic Intermediates. 
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FIGURE lb 



Glucose 



Glycolysis 



-> 2 pyruvate * 0 ATP + 2 NADH 



Pyruvate 
dehydrogenase 



2 pyruvate + 2 ADP + 2 Co A ► 2 acety 1-CoA * 2 NADH * 2 CO z 



Bgttctjon 3: 



2 Acetyl-CoA - 



TCA 



"►4C0 2 + 2 ATP + 2FADH 2 + 2NADPH + 4 NADH 



Reaction 4 : 



Glucose ■ 



Reactions; 
1,2,3 



^ 6C q 2 + 2AT? + 2FADH2 + 2NADPH + 8NA5H 



Reaction 5 : 



Entner-DoudQtofJ 
Pathway and lower 
part of glycolysis 



Glucose 6-P 



■> 2 Pyruvate + 1NADH + 1 NADPH * 1 ATP 



j 
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Figure 2. 
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FIGURE 3 A 

(a) SEQ ID NO. 1 

^TTCTTATCGTTC^GACGAQTTGQTAACACGCAA 
TGCTTC 

<b) SBQIDNO,2 

ttccagatcacc^cagaackmataaccttca 

CCTTAG 

(c) SEQ ID NO. 3 

ArcA3 ^ 4 A ^ 

agttogtaacacgcaacacgcaac 

(d) SEQ ID NO. 4 
AxcA4 

CGCAGAAGCGATAACCTTCACCG 
(c) SEQ ID NO. 5 

ATOAATCCAC7SATTGTTACGCGTA 
CTCGCTCTG^ 

(0 SEQ ID NO. 6 
StSaaAAG^ 

CKXKnX>AATAATTCACGT<XTGTCGGATGCATATGGCGGCCGC 



(g) SEQ ID NO. 7 
Edd3 

TAACATGATCTTGCGCAGATTG 



(h) SEQ 3D NO. 8 
Edd4 

ACTGCACACTCGGTACGCAGA 
<5) SEQ BO NO- 9 

ATGTCGAGTAAGTTAGTACTGGTTCTGAACTGCGGT^ 
CATCATCGATGCAGTAAATGGTGATGTGTAGGCTGGAGCTGCrT 
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FIGURE 3B 



(j) SEQ ID NO. 10 
Dpta-R 

TFACTGCTGCTG1 ^ «. » — - — - -- - _ „ _ _ , _ 

tcJ^q^gtc 



(k) SEQ1D-NO. 11 
Ack-U 

ATTCATTGAGTCGTCAAATT 



0) SEQ JD NO. 12 
Ack-D 

ATTGGGGACATAGCGGAAAT 



(ni)SBGIDNO. 13 

OT^^ATTATGGAACTOA 
CTTCG 



<n) SEQ ID NO. 14 

OTCAGACGGTCCGCGAGATAAC^ 
CTTAG 

(0) SEQ ID NO. 15 
MgsA-3 

CTTGAATTGTTGGATGGCGATG 



<p) SEGIDNO. 16 
MgsA-4 

CGTCACGTTATTGGATGAGAG 



(q) SEQ ID NO. 17 
PpcR 

TCGCATTGGCGCC 

CTCAGGATGTGGCATTGTCAAGGGCATATGAATATCCTCCTTAG 



TC^CATTGGCGCGAATATGCTCX3GGC^ 



fr) SEQ ID NO. 18 
OGATITlTrAACAT^ 

CGTAAATTCCTGCTATTTATTCGTGTGTAGGCTGGAGCTGCTTC 



PAGE 60/72 * RCVD AT 7/8/2004 2:51:58 PM [Eastern Daylight Time] * SVR:USPTO^FXRF-1/5 * DNIS:872fl306 * CSID:202 457 0107 * DURATION (mm-S5):29-32 



JUL-08-2004 15:25 



AJINOMOTO CORP SERU LLC 



202 457 0107 P. 61/72 



WO 20047053421 



PCT/US2003/U31445 



6 of 17 
FIGURE 3C 



(s) SEQ ID NO. 19 
(t) SEQ ID NO. 20 

(u) SEQIDNO.22 

^feort 1 5 GI urometer 

GCCCTTGACTATGCCACATC^ 



(v) SEQ ID NO. 23 

CAGGATGTGGCATAGTCAAGGGCATATGAATATCCTCCrTAG 



(w) SEQ ID NO. 24 

TCCGTGTAGGCTGGAGCTGCTTCa 



(x) SEQ ID NO- 25 
GapA-R3 

GTCGACAAACGCTGGTATACCTCA 



(y) SEQ ID NO. 26 
AGT^ 

CAGGATGTGGCATTGTCAAGGGCAXATGAATATGCTCCTTAG 
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Figure 4a 
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Figure 5 
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FIG. 6A. 

TAGTAAAGCCCTCGCTAGATTTTAA^ 

tgcgataacaagaaaaagccagcctttcatgatatatctccx^aatttgtgtaggg 

cttattatgcacgcttaaaaataataaaagc^gacttgacctgatagtttggctg 

tgagcaattatgtgcttagtgcatctaacgcttg^ 

cggcgtcggcntgaacgaattgttagacattatttgccgactaccttggtgatc . 

tcgcctttcacgtagtggacaaattcrn'gcaactgatctgcgcgcgaggcgaag 

cgatcttcttcttgtccaagataagcctgtctagcttcaagtatgacgggctgat 

actgggccggcaggcgctccattgcccagtcggcagcgacatccttcggcgc 

gattttgccggttactgcgctgtaccaaatgcgggacaacgtaagcactacatt 

tcgctcatcgccagcccagtcgggcggcgagttccatagcgttaaggtttcat 

ttagcgcctcaaatagatcctgttcaggaaccggatcaaagagttcctccgccg 

ctggacctaccaaggcaacgctatgttctcttgcttttgtca 

gatcaatgtcgatcgtggctggctcgaagatacctgcaagaatgtcattgcgct 

gccattctccaaattgcagttcgcgcttagctggataacgcgacggaatgatgt 

cgtcgtgcacaacaatggtgacttctacagcgcggagaatctcgctctctccag 

gggaagccgaagtttccaaaaggtcgttgatcaaagctcgccgcgttgt^ 

caagccttacggtcaccgtaacc^ggaaatcaatatcactgrrgtggcttgaggc 

cgccatccactgcggagccgtacaaatgtacggcgagcaacgtcggttcgaga 

tggcgctcgatgacgccaactacctctgatagttgagtcgatacttcggcgatc 

accgcttccctcatgatgtttaactrtgttttagggcg actgccctg ctg cgta 

acatcgttggt gct ccataacatcaaacatcgacccacggcgtaacgcgcttgc 

tgcttggatgcccgaggcatagactgtaccccaaaaaaacagtcataacaagcc 

atgaaaaccgccactgcgccgttaccaccgctgcgttcggtcaaggttctgga 

ccagttgcgtgagcgcatacgctacttgcattacagcttacgaaccgaacaggc 

TTATGTCCACTGGGTTCGTGCCnTCATCCGTTTCCA 

AACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGC 

GCAAGGTTTCGGTCTCCACGCATCGTCAGGCATTGGCGGCCTTGCTGTTCTTCT 

ACGGCAAGGTOCTGTGCACX5GATCTGCCCTGGC7TCAGGAGATCGGAAGACCT 

CGGCCGTCGCGGCGCTTGCCGGTGGTGCTGACCCCGGATGAAGTGGTTCGCA 

TCCTCGGTTTTCTGGAAGGCGAGCATCGTTTGTTCGCCCAGCTTCTGTATGGAA 

CGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATGTGGATTTC 

GATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCT 

TGATGTTACCCGAGAGCTTGGCACCCAGCCTGCGCGAGCAGGGGAATTAATTC 
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FIG. 6B 

CCACGGGTTTTGCTGCCCGCAAACGGG 

AGAATCGCAGATCCGGCTTCAGCCGGTTTGCCGGCTGAAAGCGCTATTTCTTCC 

AGAATTGCCATGATTTTTTCCCCACGGGAGGCGTCACTGGCTCCCGTGTTGTCG 

GC^GCTTrGATTCGATAAGCAGCATCGCCTGTTTCAGGCTGTCTATGTGTGACT 

GTTGAGGTGTAAGAAGTTGTCTCAGGTGTTCAATTT^ 

TTTACTGGTTTCACCTGTTCTAT^ 

CGATCTGTTCATGGTGAACAGCTTTGAATGCAOCAAAAACTCGTAAAAGCTCTG 
ATGTATCTATCTTrrTTA 

GATATGTAACGGTGAACAGTTGTTCTACTTTTGTTTGTTAGTC^ 
GATAGATACAAGAGCCATAAGAACCTGAGATCCTTCCGTATTTAGCGAGTATGTT 
CTCTAGTGTG GTT CGTTGTTTTTG CGTGAGCCATGAGAACG AACCATTGAG ATG 
ATACrTTACTlTGCATGTCACTCAAAAA 

TrGCAGTTAAAGCATCGTGTAGTGTTTTTCTTAGTCCGTTATG 

TGATGTAATG GTTGTTG GTATTTTGTCACCATTCATTTTT ATCTGGTTGTTCTC^ 

GTTCG (5TTACG AG ATCCATTTGTCTATCTAGTTCAACTTGGAAAATGAAC GTATC 

AGTCGGGCG G CCTCG CTTATCAACC ACCAATTTCATATTGCT GT AAGTGTTTAAA 

TCTTTACTTTATTGGTTTCAAAACCCATTC 

ATTTTCAAGCATTAACATGAACTTAAATTCATCAAGGCTAATCTCT 
TGTGAGTTTTCTTTTGTGT1 AGTTCTTTTAATAACCACTC^ 

T ATTTGTTTTC AAAA G ACTTAAC ATGTTCCAGATTATATTTT ATGAATTTTTTT AAC 
TGGAAAAGATAAGGCAATATCTCTTCACTAAAAACT AATTCT AATTTTTCGCTTGA 
QAACTTGG CATAGTTTGTCCACT GGAAAATCTCAAAGCCTTTAACCAAAGG ATTC 
CTGATTTC CACAGTTCTCGTC ATCAGCTCTCTGGTTG CTTTAGCTAATACAC CAT 
AAGCATTTTCCCTACTGATGTTCATCATCTGAGCGTATTGGTTATAAGTGAACGA 
TACCGTCCGTTCTTTCCTTGTAGGGTTTTCAATCGTGGGGTTGAGTAGTGCCAC 
ACAGCATAAAATTAGCTTGGTTTCATGCTCCGTTAAGTCATAGCGACTAATCGCT 
AGTTCATTTGCTTTGAAAACAACTAATTCAG ACATACATCTCAATTGGTCT AGGT 
GATTTTAATCACTATACCAATTGAGATGGGCTAGTCAATGATAATTACTAGTCC^ 
TTCCTTTGAGTTGTGGGTATCTGTAAATTCT 

TAAATTCTGCTAGACCCTCTGTAAATTCCGCTAGACCTTTGTGTGI I I ITTTTGTT 
TATATTCAAGTGGTTATAATTTATAGAATAAAGAAAGAATAAAAAAAGATAAAAAG 
AATAGATCCCAGCCCTGTGTATAACTCACTACTTTAGTCAGTTCCGCAGTATTAC 
AAAAGGATGTCGGAAACGCTGTTTGCTCCTCTACAAAACAGACCTTAAAACCCTA 
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FIG. 6C 

AAGGCTTAAGTAGCACCCTCGCAAGCTCGGGCAAATCGCTGAATATTCCTTTTG 

TCTCCGACCATCAGGCACCTGAGTCGCTGTCTTTTTCGTGACATTCAGTTCGCT 

GCGCTCACGGCTCTGGCAGTGAATGGGGGTAAATGGCACTACAGGCGCCTTTT 

ATGGATTCATGCAAGGAAACTACCCATAATACAAGAAAAGCCCGTCACGGGCTT 

CTCAGGGCGTTTTATGGCGGGTCTGCTATGTGGTGCTATCTGACTTTT^ 

TCAGCAGTTCCTGCCCTCTGATTTTCCAGTCTGACCACTTCGGATTATCCCGTG 

ACAGGTCATTCAGACTGGCTAATGGACCCAGTAAGGCAGCGGTATCATCAACAG 

GCTTACCCGTCTTACTGTCGGGAATTC^TTTAAATAGTCAAAAGCCTCCGACCG 

GAGGCTTTTGACTGCTAGGCGATCTGTGCTGTTTGCCACGGTATGCAGCACCA 

GCGCGAGATTATGGGCTCGCACGCTCGACTGTCGGACGGGGGCACTGGAACG 

AGAAGTCAGGCGAGCCGTCACGCCCTTGACAATGCCACATCCTGAGCAAATAAT 

TCAACC^CTAAACAAATCAACCGCGTTTCCCGGAGGTAACCAAGCTTGCGGGAG 

AGAATGATGAACAAGAGCCAACAAGTTCAGACAATCACCCTGGCCGCCGCCCA 

GCAAATGGCGGCGGCGGTGGAAAAAAAAGCCACTGAGATCAACGTGGCGGT G 

GTGTTTTCCGTAGTTGACCGC^GAGGCAACACGCTGCnrTATCCAGCGGATGGA 

CGAGGCCTTCGTCTCX^GCTGCGATATTTC 

GCAGCCTGAAGC^GGTACCCATGAAATTACGTCAGCGGTCCAGCCAGGACAA 
TCT CTGTACG GTCTG C AGCT AACCAACCAACAGCGAATTATTA I "1 1 1 TGGCGGC 
GGCCTGCCAGTTATTTTTAATGAGCAGGTAATTGGCGCCGTCGGCGTTAGCGG 
CGGTACGGTCGAGCAGC^TCAATTATTAGCCCAGTGCGCCCTGGATTGTTTTTC 
CGCATTATAACCTGAAGCGAGAAGGTATATTATGAGCTATCGTATGTTCCGCCA 
GGCATTCTGAGTGTTAACGAGGGGACCGTCATGTCGCTTTCACCGCCAGGCGT 
ACGCCTGTTTTAGGATCCGCGCG GGCACCATGCC GGCGCCATCAATG AGCTGT 
GCTGGGGGCTGGAGGAGCAGGGGGTCCCCTGCCAGACCATAACCTATGACGG 

aggcggtgacgccgctgcgctgggcgccctggcggccagaagctcgcccct 

gcgggtg ggtatcgggctcagcgcgtccggcg ag atagccctcactcatgc c 

cagctgccggcggacgcgccgctggctacoggacacgtcaccgatagcgacg 

atgaactgcgtacgctcggcgccaacgccggggagctggttaaagtgctgccg 

ttmg7gagagaaactgaatgtatcgtatctatacccgcaccggggataaaggc 

accaccgccctgtacggcggcagccgcatcgagaaagaccatattcgcgtcga 

ggcctacggcaccgtcgatgaactgatatcccagctgggcgtctgctacgcca 

cgacccgcgacgccgggctgcgggaaagcctgcaccatattcagcagacgct 

gttc gtgctggggg ctgaactggccagcgatgcgcg gggcctgacccgcctg 
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FIG. 6D 

AGCCAGACGATCGGCGAAGAGGAGATCACCGCCCTGGAGCGGCTTATCGACC 
GCAATATGGCCGAGAGCGGCCCGTTAAAACAGTTCGTGATCCCGGGGAGGAAT 

ctcgcctctgcccagctgcacgtggcgcgcacccagtcccgtcggctcgaac 

gcctg ctgacggccatggaccgcgcgcatccgctgcgcgacg cgctcaaacg 

ctacagcaat03cctgtcggatgccctgttctccatggcgcgaatcgaagaga , 

ctaggcctgatgcttgcgcttgaagtggcctaggaaacacagaaaaaagcccg 

cacctgagagtgcgggci i j \ r i l i cctaggcgatctgtgctgtttgccacgg 

tatgcagcaccagcgcgagattatgggctcgcacgctcgactgtcggacggg 

ggcactggaacgagaagtcaggcgagccgtcacgcccttgacaatgccacatc 

ctg ag caaataattcaaccactaaacaaatcaaccgcgtttcccgg ag gtaacc 

aagcttcaccttttgagccgatgaac^atgaaaagatcaaaacgatttgcagtac 

tggcccagcgccccgtcaatcaggacgggctgattggcgagtggcctgaaga 

ggcgctgatcgccatggacagcccctttgacccggtctcttc/vgtaaaagt^ 

acaacggtctgatcgtcgaactggacggcaaacgccgggaccagtttgagatg 

atcgaccg atttatcgccg a1tacgcgatcaac gttgagcg gacag agcaog c 

aatgcgcctggaggcggtggaaatagcccgtatgctggtggatattcacgtca 

GCCG GGAGGAGATCATTGCCATCACTACCG CCATCACGCCGGCCAAAGCGGTC 
GAGGTGATGGCGCAGATGAACGTGGTGGAGATGATGATGGCGCTGGAGAAGAT 
GCGTGCCCGCCGGACTCCCTCCAACCAGTGCCtt^ 

ATCCGGTGCAGATTGCCGCTGACGCCGCCGAGGCCGGGATCCGCGGCTTCTC 

AGAACAGGAGACCACGGTCGGTATCGCGCGCTACGCGCCGTTTAACGCCCTGG 

CGCTGTTGGTCGGTTCGCAGTGCGGCCGCCCCGGCGTGTTGACGCAGTGCTC 

GGTGGAAGAGGCCACCGAGCTGGAGCTGGGCATGCGTGGCTTAACCAGCTAC 

GCCGAGACGGTGTCGGTCTACGGCACCGAAGCGGTATTTACCGACGGCGATGA 

TACGCCGTGGTCAAAGGCGTTCCTCGCCTCGGCCTACGCCTCCCGCGGGTTGA 

AAATG CGCTACACCTCCGGCACCGGATCCG AAGCG CTG ATGGGCTATTCGGAG 

AGCAAGTCGATGCTCTACCTCGAATCGCGCTGCATCTTCATTACTAAAGGCGCC 

GGGGTTCAGGGACTC5CAAAACGGCGCGGTGAGCTGTATCGQCATGACCGGCG 

CTGTGCCGTCGGGCATTCGGGCGGTGCTGGCGGAAAACCTGATCGCCTCTATG 

CTCGAGCTCGAAGTGGCGTCCGCCAACGACCAGACTTTCTCCCACTCGGATATT 

C^CCGCACCGCGCGCACCCTGATGCAGATGCTGCCGGGCACCGACTTTATTTT 

CTCCGGCTACAGCGCGGTGCCGAACTACGACAACATGTTCGCCGGCTCGAACT 

TCGATGCGGAAGATTTTGATGATTACAAC^TCCTGCAGCGTGLACCTGATGGTTG 
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FIG. 6E 



ACGGCGGCCTGCGTCCGGTGACCGAGGCeGAAACCATTGCCATTCGCCAGAA 

AGCGGCGCGGGCGATCCAGGCGGTTTTCCGCGAGCTGGGGCTGCCGCCAATC 

GCCGACGAGGAGGTGGAGGCCGCCACCTACGCGCACGGCAGCAACGAGATGC 

CGCCGCGTAACGTGGTGGAGGATCTGAGTGCGGTGGAAGAGATGATGAAGCG 

CAACATCACCGGCCTCGATATTGTCGGCGCGCTGAGCCGCAGCGGCTTTGAGG 

ATATCGCCAGCAATATTCTCAATATGCTGCGCCAGCGGGTCACCGGCGATTACC 

TGCAGACCTCGGCCATTCTCGATCGGCAGTTCGAGGTGGTGAGTGCGGTCAAC 

GACATCAATGACT ATCAGGGG CCGG GGACCGGCTATCG C ATCTCT G CCG AACG 

CTGGG CG GAGATCAAAAATATTCCGGGCGTGGTTCAG CC CGACAC CATTGAAT 

AAGGCGGTATTCCTGTG CAACAGACAACCCAAATTCAGCC CTCTTTTACCCTG A 

AAACCCGCGAGGGCGGGGTAGCTTCTGCCGATGAACGCGCCGATGAAGTGGT 

GATCGGCGTCGGCCCTGCCTTCGATAAACACCAGCATCACACTCTGATCGATAT 

GCCCCATGGCGCGATCCTCAAAGAGCTGATTGCCGGGGTGGAAGAAGAGGGG 

CTTCACGCCCGGGTGGTGCGC^TTCTGCGCACGTCCGACGTCTCCTTTATGGC 

CTGGGATGCGGCCAACCTGAGCGGCTCGGGGATCGGCATCGGTATCCAGTCG 

AAGGGGACCACGGTCATCCATCAGCGCGATCT GCTGCCGCT GAGCAACCTGGA 

GCTGTTCTCX^C^GGCGCCGCTGCTGACGCTGGAGACCTACCGGCAGATTGGCA 

AAAACGCTGCGCGCTATGCGCGCAAAGAGTCACCTTCGCCGGTGCCGGTGGTG 

AACGATCAGATGGTGCGGC«3AAATTTATG^ 

AAAGAGACCAAACATGTGGTGCAGGACGCCGAGCCCGTGACCCTGCACATCGA 

CTTAGTAAGGGAGTGACCATGAGCGAGAAAACCATGCGCGTGCAGGATTATCC 

GTTAGCCACCCGCTGCCCGGAGCATATCCTGACGCCTACCGGCAAACCATTGA 

CCGATATTACCCTCGAGAAGGTGCTCTCTGGCGAGGTGGGCCCGCAGGATGTG 

CGGATCTCCCGCCAGACCCTTGAGTACCAGGCGCAGATTGCCGAGCAGATGCA 

GCGCCATGCGGTGGCGCGCAATTTCCGCCGCGCGGCGGAGCTTATCGCCATT 

CCTGACGAGCGCATTCTG GCTATCTATAACGCGCTGCGCCCGTTCCG CTCCTC 

GCAGGCGGAGCTGCT GGCG ATCGCCGACGAGCT GGAGGACACCTGGCATGCG 

ACAGTGAATGCCGCCTrTGTCCGGGAGTCGGCGGAAGTGTATCAGCAGCGGCA 

TAAGCTGCGTAAAGGAAGCTAAGCGGAGGTCAGCATGCCGTTAATAGCCGGGA 

TTGATATCGGCAACGCCACCACCGAGGTGGCGCTGGCGTCCGACTACCCGCAG 

GCGAGGGCGTTTGTTGCCAGCGGGATCGTCGCGACGACGGGCATGAAAGGGA 

CGCGGGACAATATCGCCGGGACXCTCGCCGCGCTGGAGCAGGCCCTGGCGAA 

AACACCGTGGTCGATGAGCGATGTCTCTCGCATCTATCTTAACGAAGCCGCGCC 
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FIG. 6F 

ggtgattqgcgatgtggcqatggagaccatcaccgagaccattatcaccgaat 

cgaccatgatcggtcataacccgcagacgccgggcggggtgggcgttggcgt 

ggggacgactatcgccctcgggcggctggcgacgctgccggcggcgcagtat 

gccgaggggtggatcgtactgattgacga«pccgtcgatttccttgacgccgt 

gtggtggctcaatgaggcgctcgaccgggggatcaacgtggtggcggcgatc 

ctcaaaaaggacgacggcgtgctggtgaacaaccgcctgcgtaaaaccctgcc 

ggtggtggatgaagtgacgctgctggagcaggtccccgagggggtaatggcg 

gcggtggaagtggccgcgccgggccaggtggtgcggatcctgtcgaatccct 

acgggatcgccaccttctt cgggctaagcccggaagagacccaggccatcgtc 

cccatcgcccgcgccctgattggcaaccgttccgcggtggtgctcaagacccc 

gcagggggatgtgcagtcgcgggtgatcccggcgggcaacctctacattagc 

ggcgaaaagcgccgcggagaggccgatgtcgccgagggcgcggaagcgatc 

atgcaggcgatgagcgcctgcgctccggtacgcgacatccgcggcgaaccgg 

gc^cccacgccggcggcatgcttgagcgggtgcgcaaggtaatggcgtccct 

gaccggccatgagatgagcgcgatatacatcgaggatctgctggcggtggata 

cgtttattccgcgcaaggtgcagggcgggatggccggggagtgcggcatgga 

gaatgc cgtcgg gatgggggcgatggtgaaagcggat cgtctgcaaatgcag 

gttatcgcccgcgaacrrgagcgcccgactgcagaccgaggtggtggtgggcg 

GCGTGGAGGCCAACATGGCCATCGCCGGGGCGTTAACCACTCCCGGCTGTGC 

GGCGCCGCTGGCGATCCTCGACCTCGGCGCCGGCTCGACGGATGCGGCGATC 

GTCAACGCGGAGGGGCAGATAACGGCGGTCCATCTCGCCGGGGCGGGGAATA 

TGGTCAGCCTGTTGATTAAAACCGAGCTGGGCCTCGAGGATCTTTCGCTGGCG 

GAAGCGATAAAAAAATACCCG ctggccaaagtggaaagcctgttcagt attcgt 

CACGAGAATGGCGCGGTGGAGTrCTTTCGGGAAGCCCTCAGCCCGGCGGTGTT 

CGCCAAAGTGGTGTACATCAAGGAGGGCGAACTGGTGCCGATCGATAACGCCA 

GCCX:GCTGGAAAAAATTCGTCTCGTGCGCCGGCAGGCGAAAGAGAAAGTGTTT 

GTCVVCCAACTGCCTGCGCGCGCTGCGCCAGGTCTCACCCGGCGGTTCCATTCG 

CGATATCGCCTTTGTGGTGCTGGTGGGCGGCTCATCGCTGGACTTTGAGATCC 

CGCAGCTTATCACGGAAGCCTTGTCGCACTATGGCGTGGTCGCCGGGCAGGG 

CAATATTCGGGGAACAGAAGGGCCGCGCAATGCGGTCGCCACCGGGCTGCTA 

CTGGCCGGTCAGGCGAATTAAACGGGCGCTCGCGCCAGCCTCTAGGTACAAAT 

AAAAAAGGGACGTGAGATGACGTGCC I 111 I 1 CTTGTCTAGAGTACTGGCGAAA 

GGGGGATGTGCTGCMGGCGATTAAGTTGGGTAACGCCAGGGITTTCCCAGTC 
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ACQACGTTGTAAAACGACGGCCAGTGMTTCGA<3CTCGGTACCCGGGGC<3GCC 

GCGCTAGCGCCCGATCCAGCTGGAGTTTGTAGAAACGCAAAAAGGCCATCCGT 

CAGGATGGCCTTCTGCTTAATrTGATGCCTGGCAGTTTATGGCGGGCGTCCT 

CCGCCACCCTCCGGGCCGTTGCTTCGCAACGTTCAAATCCGCTCCCGGCGGAT 

TTGTCCTACTCAGGAGAGCGTTCACCGACAAACAACAGATAAAACGAAAGGCCC 

AGTCTTTCGACTGAGCCTTTCGTTTTATT^ 

CATGGGGAGACCCCACACTACCATCGGCGCTACGGCGTTTC^CTTCTGAGTTC 

GGCATGGGGTCAGGTGGGACCACCGCGCTACTGCCGCCAGGCAAATTCTGTTT 

TATC^GACCGCTTCTGCGTTCTGAT^^A^TCTGTATCAGGCTGAAAATCTTCTCT 

CATCCGCCAAAACAGCCAAGCTTGCATGCCTGCAGCCCGGGTTACCATTTCAAC 

AGATCGTCCTTAGCATATAAGTAGTCGTCAAAAATGAATTCAACTTCGTCT 

CGGCATTGTAGCCGCCAACTCTGATGGATTCGTGGTTTTTGACAATGATGTCAC 

AGCGTTTTTCCrrTTAGGAAGTCCAAGTCGAAAGTAGTGGCAATACCAATGATC^ 

ACAACCGGCGGCTTTTCCGGCGGCAATACCTGCTGGAGCGTCTTCAAATACTAC 

TACCTTAGATTTGGAAGGGTCTTGCTCATTGATCGGATATCCTAAGCCATTCCTG 

CCCTTCAGATATGGTTCTGGATGAGGCTrACCCTGTTTGACATCATTAGCGGTA 

ATGAAGTACTTTGGTCTCCTGATTCCCAGATGCTCGAACCATTTTTGTGCCATAT 

cacgggtaccg gaagttgccacagcccatttct c r T r I GGTAGAGCGTTCAAAG 

CGTTGCACAGCTTAACTGCACCTGGGACTTCAATGGAI \ I n CACCGTACTTGA 
CCGGAATTTCAG CTT CT AATTTGTTAACATACTCTT CATTGGCAAAGTCTG GAG C 
GAACTTAGGAATGGCATCAAACGTTCTCCAACC^ 

TTCAGCATCGAAATAAGGTTTGTCCTTACCGAAATCCCTCCAGAATGCAGCAAT 

GGCTGGTTGAGAGATGATAATGGTACCGTCGACGTCGAACAAAGCGGCGTTAA 

CTTTCAAAG ATAGAGGTTTAGTAGTC AATCCC ATAATTCT AGTCT GTfTCCTGGA 

TCCAATAAATCTAATCTTCATGTAGATCTAATTCTTC AATCATGTCCGGCAG GTT C 

TTCATTGG GTAGTTGTTGT AAACG ATTTGGTATACG G CTTCAAATAATGGG AAGT 

CTTCGACAGAGCCACATGTTTCGAACGATTCGTGAACTTCTTTGCAGGTAAT^ 

ACCTTGAGCGGATTGGCCATTCAACAACTCCTTTTCACATTCCCAGGCGTCC^ 

ACCAGAAGTAGCCATTAGCCTAGCAACCTTGACGTTTCTACCACCAGCGCAGGT 

GGTGATCAAATCAGCAACACC^GCAGACTCTTGGTAGTATGTTTCTTCTCTAGAT 

TCTGGGAAAAACATTTGACCGAATCTGATGATCTCACCCAAACCGACTCTTTGG 

ATGGCAGGAGAAGCGTTGTTACCCGAGCCTAGACCTTCGACGAAACCACAACCT 

AAGGCAACAACGTTCTTCAAAGCACCACAGATGGAGATACCAGCAACATCTTCG 
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atgacactaacgtggaaqtaaggtctgtggaacaaggcctttagaaccttatgg 

tcgacgtccttgccctcgcctctgaaatcctttggaatgtggtaagcaactgtt 

gtttcagaccagtgttcttgagcgacttcggtggcaatgttagcaccagataga 

gcaccacattgaatacctagttcctcagtgatgtaagaggatagcaattggaca 

cctrtagcaccaacttcaaaaccctttagacaggagatagctctgacgtgtgaa ■ 

tcaacatgacctttgaattggctacagatacggggcaaaaattgatgtggaatg 

ttgaaaacgatgatgtcgacatccttgactgaatcaatcaagtctggattagc^ 

ccaaattgtcgggtagagtgatgccaggcaagtatttcacgttttgatgtctag 

tatttatgatttcagtc aa' i i i \ i caccattgatctcttcttcgaacacccacatt 

TGTACTATTGGAGCGAAAACTTCTGGGTATCCCTTACAATTTTCGGCAACCACCT 

TGGCAATAGTAGTACCCCAGTTACCAGATCCAATCACAGTAACCTTGAAAGGCT 

TTTCGGCAGCCTTCAAAGAAACAGAAGAGGAACTT 

GTG G C CG GAAGTTAAGTTTAATCTATCAGCAG CaG CAG CCATGGAATTGTCGTC 

CTTACTAGTCATGGTCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCAC 

ACATTATACGAGCCGGATGATTAATTGTCAACAGCTC^TTTCAGAATATTTG 

GAACCGTTATGATGTCGGCGCAAAAAACATTATCCAGAACGGGAGTGCGCCTTG 

AGCGAC^CGAATTATGC^GTGATTTACGACCTGCAC^GCCATACCACAGCTTCC 

G ATGG CT G CCTGACG CCAGAAGCATTGGTGCACGCTAGCC^GTAC ATTTAAATG 

GT ACCCTCT AGTCAAGGCCTTAAGTGAGTCGTATTACGGACT GGCCGTCGTTTT 

ACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGC 

ACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCC 

CTTCCCAACAGTTGCGC^GCCTGAATGGCGAATGGCGCCTGATGCGGTATTTTC 

TCCTTACGC^TCTGTGCGGTATTTCACACCGCATATGGTGCACTCTCAGT ACAAT 

(TrGCrrCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTG 

ACGAGCT 
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